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TITLE OF THE INVENTION 

METHOD OF DETERMINING MOVEMENT SEQUENCE, 
ALIGNMENT APPARATUS, METHOD AND APPARATUS OF DESIGNING 
OPTICAL SYSTEM, AND MEDIUM IN WHICH PROGRAM REALIZING 
5 THE DESIGNING METHOD 

BAGROUND O F THE INVENTION 

Field of the Invention 

The present invention relates to a determining 
method of movement sequence and an alignment apparatus, 

10 for example, for reducing the time of alignment 

between a pattern of an original plate and marks on a 
substrate in exposure apparatus, to a designing method 
and apparatus of an optical system such as a 
projection optical system of the exposure apparatus or 

15 a lens system for camera, and to a medium in which a 

program for realizing the designing method is recorded. 
Related Background Art 

In general, the exposure apparatus is arranged 
in such a way that, before carrying out exposure of 

20 the second layer or a layer thereafter into chip areas 

(or shot areas) on a wafer (photosensitive substrate) 
in which predetermined circuit patterns are to be 
formed, alignment is accomplished between a pattern of 
an original plate for the second or subsequent layer 

25 and the chip areas by use of EGA (a statistical 

arithmetic method). The EGA (Enhanced Global 
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Alignment) is a technique for measuring positions of 
alignment marks (measured areas) provided mainly in 
the peripheral area of a plurality of selected chip 
areas to obtain a residual rotation error of the wafer, 
linear expansion or contraction of the wafer, an 
offset of the wafer, etc. and, based thereon, aligning 
tt^'^l chip areas of the wafer, for example, as 
disclosed in Japanese Laid-open Patent Application No. 
Sho 61-44429. As another technique of a further 
development of the EGA, United States Patent No. 
4,780,617 discloses an alignment technique for 
obtaining a residual rotation error of each chip area 
itself, an orthogonality error of the chip areas, and 
linear expansion or contraction of each chip area 
itself and performing alignment so as to minimize even 
these errors. 

Particularly, when the technique of Japanese 
Laid-open Patent Application No. Hei 6-275496 is 
applied,, because of the many alignment marks to be 
measured, the measurement time will be veiry long 
unless the alignment marks are measured as efficient 
as possible. For example, let us consider an example 
in which there are 7 6 exposed chip areas (areas 
indicated by number 01 to number 7 6 in the figure) in 
the first layer of the wafer W and four alignment 
marks are provided for each of the chip areas, as 


NIK97-13US 


shown in Fig. 1. In this case, in the alignment of 
wafer by the EGA, the operator first selects a 
plurality of chip areas that are inside the outermost 
region and at vertices of polygon (for example, twenty 
chip areas hatched in Fig. 2), on an empirical basis. 
Coordinates of the designed center of each chip area 
(representing the position of each chip area) are 
stored in a memory of a main control system. Positions 
of four alignment marks of each chip area (defined by 
coordinates of the center thereof) are also stored in 
the memory of the main control system. Accordingly, 
the exposure apparatus was arranged to measure the 
position of each alignment mark according to the 
following movement sequence empirically seeming best, 
by executing the EGA. 

Specifically, for example, when a measuring 
point of an alignment optical system is at a start 
point ST (x: 186.5, y: 155.5), an XY stage with a — 
wafer mounted thereon moves so that a right upper 
alignment mark of a chip area closest to the start 
point ST (the chip area 64 in Fig. 2) comes to the 
measuring point of the alignment optical system (so as 
to be in the measuring area). After completion of the 
position measurement of the alignment mark, the XY 
stage moves so as to measure the positions of the four 
alignment marks counterclockwise. Next, the XY stage 
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moves so as to measure coordinates of the right upper 
alignment mark of a chip area closest clockwise (the 
chip area 63 in Fig. 1). After that, the XY stage 
moves so as to measure coordinates of the four 
alignment marks counterclockwise. Repeating this 
operation, the XY stage moves to measure the positions 
of the alignment marks of the all chip areas selected 
and return the measuring point of the alignment 
optical system to the end point EN (x: 215, y: 133). 
Of course, such controls of movement were also 
employed that the XY stage moved so as to measure the 
positions of the alignment marks of each chip area 
clockwise and that after completion of the position 
measurement of the all alignment marks of a chip area, 
the XY stage moved so as to measure the alignment 
marks of a chip area closest counterclockwise. 

However, the movement sequence of the XY stage 
in the position measurement of each alignment mark was 
determined empirically as described above, and no 
consideration was given to efficient movement control 
of the XY stage for the position measurement of each 
alignment mark. 

The reason is that there arises the following 
problem in obtaining the movement sequence of the XY 
stage for the position measurement of alignment marks 
using the statistical measurement process such as the 
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EGA. For example, where there are n alignment marks to 
be measured on the wafer / the number of conceivable 
stage movements for movement between alignment marks 
is at most = n(n - 1) (even though the turnaround 
5 time differs depending upon the positive or negative 

movement direction of the stage) and computation 
thereof can be done quickly. Therefore ^ the overall 
turnaround time is determined uniquely as soon as the 
measurement process order is determined. However, 

10 there are n! way as to the order for the measurement 

process of n alignment marks ^ and the computation time 
becomes too long when the all possible solutions are 
computed using the producing and checking method of 
the all conceivable orders- Particularly, if n > 13, 

15 the computation is practically impossible ("Practical 

Course: Invitation to Traveling-Salesman Problems I, 
II, III," Operations Research 39 (1994), No. 1: pp 25- 
31, No. 2 : pp 91-96 ^ No. 3 : pp 156-162). Accordingly, 
the conventional alignment methods did not involve a 

20 step of finding the optimum movement sequence under 

practical operation conditions. 

NOW/ let us focus attention on the optical 
system such as the projection optical system of the 
aforementioned exposure apparatus. The designing of 

25 the optical system including lens elements has been 

known heretofore and is known as a very difficult 
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issue. This is because various factors/ such as 
multiple dimensions / a super-multimodal property , 
strong dependent relation between variables, or 
complex constraints / make the issue tough. In addition, 
as criteria for evaluation of the optical system to be 
designed there exist numerous evaluation criteria such 
as the Seidel's five aberrations, the size, or the 
cost. 

In the conventional designing method of optical 
system the basic search is a local search in the 
neighborhood of an initial or starting solution. If 
the initial solution is not appropriate, the result 
will fall into a local solution, so that the search 
will end unsuccessfully. It was thus the conventional 
practice to employ a method for changing the initial 
solution in a trial and error manner in order to find 
an optical system having the aimed performance. Since 
the conventional search basically allowed optimization 
of only one evaluation criterion, the designing 
process was changed to a single-objective process by 
setting a tradeoff ratio, in spite of the many 
evaluation criteria (Yoshiya Matsui: Lens Designing 
Method, Kyoritsu shuppan (1986); Jihei Nakagawa: Lens 
Design Engineering, Tokai daigaku shuppankai (1986); 
Toru Kusakawa: Lens Optics, Tokai daigaku shuppankai 
(1988)). 
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It is not possible to preliminarily know the 
tradeoff ratio for obtaining the optical system having 
the aimed performance. It is thus the present status 
that loads on experts are very heavy in the search for 
the initial solution for local search and in the 
search for the tradeoff ratio between the evaluation 
criteria. 

Further, the conventional method for modifying 
the optical system is a method for, with data of one 
optical system preliminarily given as initial data by 
the designer, altering plural parameters, including 
radii of curvatures of boundary surfaces in respective 
optical elements (lens elements, reflectors, etc.) 
belonging to this optical system, distances between 
the boundary surfaces, and refractive indices of 
spaces (the lens elements and aerial lenses between 
the lens elements) located between the boundary 
surfaces, using an index of increase or decrease of a 
performance function indicating the performance of the 
lens optical system at that time. 

Then the same improving procedure is repeated 
using the data of the optical system represented by 
the plural parameters after the alteration, as a new 
solution (i.e., the optical system to be improved). 
For example, if good or bad optical performance is 
reflected to increase or decrease of the performance 
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function/ the plural parameters will be altered so as 
to increase the performance function and updated each 
to the parameters after the alteration ^ as values of 
new parameters. 

On the other hand, the genetic algorithm (GA) is 
known as one of optimization techniques, which 
imitates the evolutionary process of organism on an 
engineering basis. This genetic algorithm (hereinafter 
referred to as GA) is a generate and test method, 
which is characterized in that the essential point is 
only that dominance can be evaluated between two 
solution candidates. Therefore, it does not require 
the condition of differentiability of the performance 
function or the like and is thus effective to problems 
with complex constraints. The GA also has the feature 
of performing a search using a population of plural 
solution candidates and is drawing attention as a 
global search technique. Further, the GA is also 
drawing attention as a multi-objective optimization 
technique for handling the plural evaluation criteria 
explicitly and finding a Pareto optimal solution set 
by a single search. 

For example, M. WALK AND J. NIKLAUS, "Some 
Remarks on Computer-Aided Design of Optical Lens 
System" (JOURNAL OF OPTIMIZATION THEORY AND 
APPLICATION: Vol. 59, No. 2, pp. 173-181, NOVEMBER 
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1988) and X- CHEN AND K. YAMAMOTO, "Genetic algorism 
and its application in lens design" (SPIE, Vol, 2863/ 
PP, 216-221) describe the technology of application of 
the above GA to the design of optical system. 

5 SUMMARY O F THE TNVENTIQN 

The present invention concerns a movement 
control of a stage with a wafer mounted thereon for 
measuring positions of plural measurement target areas 
(including alignment areas) provided on the wafer in a 

10 shorter time^ in alignment between a photomask or a 

reticle (hereinafter referred to generally as 
"reticle") and each chip area on the wafer, and more 
particularly, the invention relates to a determining 
method of movement sequence and an alignment apparatus 

15 to minimize the overall turnaround time of the stage 

movement associated with the sequential measurement 
process. 

The inventors excunined the conventional 
alignment technology described above and found the 
20 following issues. 

First, the determining method of movement 
sequence in the measurement process carried out for 
alignment must obtain an optimum solution or a near- 
optimum solution to the permutation optimization 
25 problem within a shorter computation time. As 

described above, in the case of the movement sequence 
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of the stage, an ideal process is to produce all 
possible measurement orders (movement sequences) of 
alignment mark positions and to find the shortest 
turnaround time (the overall movement time excluding 
5 the measurement times) as an optimum solution out of 

these candidates generated • However, when examination 
is made as to the order of measurement of the wafer 
having n alignment marks, nl paths must be checked 
only for the order of measurement of positions of the 

10 n alignment marks on the wafer. Especially, if n > 13, 

the computation time will become so enormous that it 
is practically impossible to obtain a solution. 
Therefore, in order to increase the throughput of the 
EGA, it is necessary to obtain the optimum solution or 

15 the near-optimum solution of the movement sequence of 

measurement process more efficiently. 

Second, the deteirmination of movement sequence 
of measurement process needs to take account of the 
tradeoff between the permissible computation time for 

20 obtaining the optimum solution or the near-optimum 

solution and the quality of the obtained solution. 
Specifically, the computation for obtaining the 
optimum solution of the movement sequence for. the 
position measurement of alignment marks is carried out 

25 on the occasion of exchange of reticles of different 

exposure patterns or on the occasion of exchange of 
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wafers from the reason that positions for the 
measurement process are designated arbitrarily wafer 
by wafer, even though they have a common exposure 
pattern. Therefore / permissible computation times vary 
5 depending upon the circumstances. For example, the 

time for exchange of reticles and alignment of the 
reticle (the reticle loading time) is normally 20 sec 
or so. The time for loading a first wafer in a certain 
lot (the wafer loading time) is normally 5 sec or so. 

10 An object of the present invention is, therefore, 

to provide a determining method of movement sequence 
for obtaining a solution of a preferable movement 
sequence within a short time and an alignment 
apparatus provided with an arithmetic unit for 

15 carrying out the determining method. The determining 

method of movement sequence according to the present 
invention is a method to increase the throughput of 
EGA, in which a near-optimum solution of the movement 
sequence is first obtained within a very short 

20 computation time, then solutions of movement sequences 

to make shorter the movement time for the measurement 
process of alignment mark positions are successively 
produced as long as the computation time allows, and 
the optimum isolution of the movement sequence is 

25 generated finally (if the sufficient, permissible 

computation time is given) . This can provide a 
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solution in a quality consistent with the permissible 
computation time given (the better the longer the 
permissible computation time), depending upon the 
circumstances of computational resources that can be 
5 used. 

The determining method of movement sequence 
according to the present invention is carried out 
prior to an alignment step of performing alignment 
between a transfer position of a pattern of an 

10 original plate (a mask or a reticle) and each chip 

area on a substrate (wafer) ^ the alignment step being 
a step of measuring positions of measurement target 
areas while successively moving the measurement target 
areas (alignment marks) on the substrate into a 

15 measuring area of a measuring system (an alignment 

optical system) . The method according to the present 
invention determines the movement sequence indicating 
the measurement order of the alignment marks within a 
shorter time, thereby drastically increasing the 

20 throughput of the EGA. 

Specif ically, the determining method of movement 
sequence according to the present invention comprises 
an arithmetic step of obtaining a solution of a most 
preferable movement sequence with respect to an 

25 overall movement time between the measurement target 

areas by use of a predetermined search technique. This 
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arithmetic step includes at least a first step of 
generating a group including a plurality of executable 
movement sequences out of a group of movement sequence 
candidates each indicating a measurement order of 
5 plural measurement target areas, and a second step of 

selecting a movement sequence that completes the 
movement operation between the plural target areas 
within the shortest time out of the group thus 
generated . 

10 Further, the determining method of movement 

sequence according to the present invention comprises 
a pre-step executed prior to the above arithmetic step, 
the pre-step being a step of producing a movement time 
management table in which for each of the plural 

15 measurement target areas, a movement time is recorded 

as a time necessary for movement of the target area of 
interest from a position thereof at the time of 
completion of position measurement of either one of 
the plural measurement target areas into the measuring 

20 area of the measuring system. This movement time 

management table also includes such information that 
for a pair of measurement target areas selected out of 
the plurality of measurement target areas, after 
completion of the position measurement of one 

25 measurement target area selected, the other 

measurement target area selected is prohibited from 
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moving from a position thereof at the time of 
completion of the position measurement of the one 
measurement target area into the measuring area of the 
measuring system. 

Particularly, the search technique executed in 
the above arithmetic step includes at least either one 
of a method based on operations-research technique, a 
genetic algorithm (hereinafter referred to as GA) as 
an evolutionary computation method, and a combination 
thereof. The method based on operations-research 
technique includes at least either one of a linear 
programming method, a Lin and Kernighan's approach 
(hereinafter referred to as LK method), and a k-OPT 
method . 

The above GA can include as a genetic operator a 
search technique like an improving method such as the 
above method based on operations-research technique 
(the linear programming method (NN Method), the LK 
method (S. Lin and B. W. Kernighan, An Effective 
Heuristic Algorithm for the Traveling Salesman Problem, 
Operations Research 21 (1973) pp 498-516), or the k- 
OPT method (including the 2-OPT method and 3-OPT 
method) ("Practical Course: Invitation to the. 
Traveling-Salesman Problems I, II, III," Operations 
Research 39 (1994) No. 1: pp 25-31, No. 2: pp 91-96, 
No. 3: pp 156-162). By this, a solution of a most 
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preferable movement sequence out of solutions found at 
that time can be always obtained even on the way of 
computation for seeking the movement sequence. If a 
further computation time is given , a solution of a 
more preferable movement sequence can be obtained. 
Therefore, according to the present invention, a good 
solution of a movement sequence consistent with the 
computation time permitted can be obtained even if the 
computation is interrupted or even if a limitation is 
preliminarily imposed on the computation time. 

On the other hand, the alignment apparatus 
according to the present invention comprises at least 
a measuring device (measuring system) for measuring 
each of positions of plural alignment marks, a moving 
device for effecting relative movement between the 
plural alignment marks and a measuring area of the 
measuring device, an arithmetic section for carrying 
out the above-stated determining method of movement 
sequence, and a control section for controlling the 
moving device so as to move the plural alignment marks 
successively into the measuring area of the measuring 
device, according to a solution of a movement sequence 
obtained by the arithmetic section. 

Further, the alignment apparatus according to 
the present invention comprises a memory for storing a 
movement time management table in which for each of 

15 
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the plural alignment marks a movement time is recorded 
as a time necessary for movement of the alignment mark 
of interest from a position thereof at the time of 
completion of position measurement of either one of 
5 the plural alignment marks into the measuring area of 

the measuring device. The movement time management 
table stored in this memory also includes such 
information that for a pair of alignment marks 
selected out of the plural alignment marks, after 

10 completion of the position measurement of one 

alignment mark selected, the other alignment mark 
selected is prohibited from moving from a position 
thereof at the time of completion of the position 
measurement of the one alignment mark into the 

15 measuring area of the measuring device. 

The GA applied as the above search technique is 
used, for example, as an approach for the case to find 
- the shortest-length path in visiting each city only 
once and all cities, but there has been and is no 

20 example of application to optimization of the movement 

sequence for the position measurement of alignment 
marks in the steppers or the like. 

As described above, in the present invention, 
the computation of optimization (how efficiently the 

25 XY stage is moved) of the movement sequence for the 

position measurement by the search technique (the 
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linear programming method, the LK method, the k-OPT 
method, or the GA) is carried out prior to the 
measurement of the positions of plural alignment marks. 
Based on the computation result, the alignment 
5 apparatus according to the present invention operates 

to move the XY stage so as to bring each of the 
alignment marks into the measuring area of the 
alignment optical system. Then the position of each 
alignment mark is measured by the alignment optical 

10 system. 

Since the movement sequence for alignment was 
determined empirically before, it was not always the 
movement sequence for good or optimal alignment. In 
contrast, the present invention allows us to find a 

15 good or optimal movement sequence and can thus 

increase the throughput of EGA. 

Meanwhile, in the case of the conventional 
designing method and modifying method of optical 
system described previously, once the performance 

20 function reaches a locally maximal value, no further 

improvement is made even by execution of further 
improving procedures. An optical system associated 
with the parameters obtained under such circumstances 
is a locally optimal solution or a locally optimal 

25 optical system and a possibility that it is a true 

optimal solution or a globally optimal solution, is 
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very low. Accordingly^ the performance of the optical 
system as a solution achieved by the conventional 
designing method and modifying method of optical 
system is dependent on the optical system first given 
5 as an initial solution by the designer. This means 

that a possibility that the optimal solution will not 
come out is extremely high if the modality of the 
optical system of the optimal solution is greatly 
different from that of the optical system first given 

10 as an initial solution by the designer. 

In the conventional technology for designing the 
optical system by use of the genetic algorithm, since 
the genetic operator used in the GA was able to handle 
only discrete numbers, the plural parameters featuring 

15 the optical system, which were originally continuous 

variables , needed to be digitized as discrete values 
and it was thus hard to obtain a solution with 
sufficient accuracy within a practical time. 

Specifically, the conventional technology for 

20 executing the GA in a discrete fashion often utilizes 

the coding/crossover procedures as shown in Fig. 3 to 
Fig. 5, in which each of parameters constituting a 
parent individual or a child individual is expressed 
by a binary code and in which the binary codes are 

25 used in combination with one point crossover, two 

point crossover, or uniform crossover. Fig. 3 is a 
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drawing to show a step of generating one set of 
parameters of child individuals from one set of 
parameters of parent individuals expressed by binary 
codes, by the one point crossover. Fig. 4 is a drawing 
5 to show a step of generating one set of parameters of 

child individuals from one set of parameters of parent 
individuals expressed by binary codes, by the two 
point crossover, and Fig. 5 is a drawing to show a 
step of generating one set of parameters of child 
10 individuals from one set of parameters of parent 

individuals expressed by binary codes, by the uniform 
crossover. 

In the discrete GA as described above, the phase 
structure (binary codes) of a genotype space is 

15 greatly different from that (actual values) of a 

phenotype space. This means that a character 
(property) of a parent individual becomes unlikely to 
be inherited by a child individual newly generated. 
Therefore, even if parent individuals of a certain 

20 generation approach the optimal solution, child 

individuals of the next generation could deviate from 
the optimal solution with an extremely high 
probability to be farther therefrom than their parent 
individuals. This raises the problem that wasteful 

25 searches increase considerably. 

Namely, for achieving sufficient accuracy in the 
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discrete GA, it is necessary to make the degree of 
digitization of each parameter finer. This inevitably 
increases the gene information and causes considerable 
increase of computation time. Conversely ^ if each 
5 parameter is roughly digitized in order to obtain a 

solution within a practical computation time^ there 
will arise a risk of failing to find the optimal 
solution where the optimal solution exists in a 
narrowest gap between discrete numbers. 

10 When consideration is given to the evaluation 

criteria of the optical system designed, there are 
many evaluation criteria contradicting with each other 
in the actual performance aspect (for example, 
resolution and distortion) . In either one of the 

15 conventional technology described above, a plurality 

of evaluation criteria are integrally expressed in the 
form of a performance function by preliminarily 
weighting the reciprocity among the evaluation 
criteria uniquely. 

20 However, whether the preset weighting is valid 

or not can be first checked after various optimal 
solutions have been obtained with changing weights of 
the respective evaluation criteria. A solution adapted 
for each evaluation criterion is generally called a 

25 Pareto optimal solution, and there are plural Pareto 

optimal solutions corresponding to various weightings. 
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Therefore, where there exist plural evaluation 
criteria of the optical system designed, multi- 
objective optimization needs to be effected so as to 
fit each evaluation criterion, in the case of the 
conventional designing method and modifying method of 
optical system, the resultant solution cannot be 
always determined to be the optimal solution. 

The present invention has been accomplished to 
solve the above problem. A principal object of the 
present invention is to provide a designing method and 
apparatus of optical system for obtaining an optical 
system as a globally optimal solution within a 
practical time, independent of initial data 
preliminarily given, and a medium in which a program 
for realizing the designing method is recorded. 
Specifically, the present invention relates to a 
designing method, apparatus, and so on for designing 
an optical system including at least one optical 
element (lens element, reflector, etc.), for example, 
such as a lens for photography, a lens for microscope, 
or a projection optical system in the projection 
exposure apparatus of the one-shot exposure method or 
the scanning exposure method (e.g., a step-and-repeat 
type stepper or a step-and-scan type stepper), and 
also enables optimal relocation (i.e., correction for 
locations) where the lens elements belonging to the 
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optical system to be designed are not processed 
exactly in designed values, and optimization of 
combination of lens elements where a plurality of 
optical systems including such lens elements not 
processed in the designed values are produced. 

A designing method of optical system according 
to the present invention is a method for designing an 
optical system including at least one optical element 
(a lens element, a reflector, or the like) by use of 
an evolutionary computation method (genetic algorithm), 
which is characterized in that the optical system is 
optimized by use of a genetic operator for directly 
handling continuous values. Specifically, the 
designing method of optical system comprises at least 
a selection step of selecting at least two parent 
individuals from a generation population of n 1, 
wherein n = 1 indicates an initial population) having 
a plurality of individuals of data of the optical 
system to be designed, a child generation step of 
newly generating a population consisting of a 
plurality of child individuals by applying a genetic 
operator, being at least either one of a crossover 
operator and a mutation operator, to the parent 
individuals selected, and a survival selection step of 
generating a next generation population. 

Particularly, an individual that is a candidate 
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for the optical system to be designed is given as a 
real vector having components ^ for example, including 
curvatures of boundary surfaces specifying the lens 
element, the reflector, or the like included in the 
5 optical system to be designed, a distance between the 

boundary surfaces, and a refractive index of a space 
located between the boundary surfaces, whereby mapping 
or coding is effected from the data of the optical 
system of the individual (phenotype) to the genotype. 
10 The optimization of the optical system does not have 

to be made for the all components of the real vector, 
but can be done individually for a specific vector 
component (at least one of the plural parameters 
featuring the optical system to be designed) . 
15 In the designing method of optical system 

according to the present invention, the above child 
generation step is the step of generating the child 
individuals by at least either one of- the crossover 
operator (a crossover step) and the mutation operator 
20 (a mutation step). Namely, from the inside of a 

partial space defined by a predetermined continuous 
occurrence probability distribution of occurrence 
probabilities set based on components of real vectors 
of the parent individuals selected, the crossover 
25 operator generates as a child individual a real vector 

having a value occurring according to the occurrence 
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probabilities. From the inside of a partial space 
defined by a predetermined continuous occurrence 
probability distribution of occurrence probabilities 
increasing with approaching at least one parent 
individual out of the parent individuals selected, the 
mutation operator generates as a child individual a 
real vector having a component of a value occurring 
according to the occurrence probabilities. 

In a search process in which the above selection 
step, child generation step, and survival selection 
step are executed repetitively, the child individuals 
newly generated from the parent individuals selected 
succeed to characters (properties) of their parent 
individuals, thereby avoiding wasteful searches. In 
the initial stage of the search process, the parent 
individuals are apart from each other and the 
individuals are also scattered in various spaces, 
depending upon distances between the parent " 
individuals. The search is thus done in a wide range* 
With progress in searches <with increase in the number 
of executions of the above steps), the distances 
between the parent individuals become shorter, so that 
more child individuals will be generated in a partial 
space including the optimal solution, depending upon 
the distances between the parent individuals. 

The above crossover operator is a genetic 
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operator that directly handles continuous values / 
which can be selected, for example, from UNDX (Ono, I. 
and Kobayashi, S: A Real-coded Genetic Algorithm for 
Function Optimization Using Unimodal Normal 
5 Distribution Crossover, Proceeding of 7th 

International Conference on Genetic Algorithms, pp. 
246-253 (1997)), BLX-a (N- J- Radcliffe: Formal 
Analysis and Random Respectful Recombination, 
Proceeding of the Fourth International Conference on 

10 Genetic Algorithms, pp. 222-229, 1991); NDX (I. Ono, M. 

Yamamura and S. Kobayashi: A Genetic Algorithm with 
Characteristic Preservation for Function Optimization, 
Proceedings of IIZUKA '96, pp. 511-514, 1996); or UNDX 
(Ono, I. and Kobayashi, S: A Real-coded Genetic 

15 Algorithm for Function Optimization Using Unimodal 

Normal Distribution Crossover, Proceeding of 7th 
International Conference on Genetic Algorithms, pp. 
246-253 (1997) ) . 

Further, the above survival selection step is to 

20 select individuals to be left as individuals in the 

next generation population out of the n-th generation 
population including the parent individuals and the 
population of the child individuals newly generated. 
In the above survival selection step, in order 

25 to realize the multi-objective optimization, 

individuals satisfying at least either criterion out 
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of one or two or more evaluation criteria are selected 
from the population including the child individuals 
generated. This selection of individuals is carried 
out preferably so as to select individuals to be left 
as individuals in the next generation in order in 
proportion to fitness values of the respective 
individuals from the fittest to a predetermined 
evaluation criterion. 

Further, in the designing method of optical 
system according to the present invention, the above 
survival selection step is preferably a step of 
generating the next generation population by replacing 
individuals not selected in the n-generation 
population with the selected individuals. 

This designing method of optical system 
according to the present invention can be realized in 
the form of a program described in a predeteirmined 
language, and this program" is carried out, for example, 
by a computer having basic components of an arithmetic 
circuit with memories (RAM, ROM), input/output 
sections, an arithmetic section, a memory section, and 
a control section. Particularly, when carried out by 
the computer, the program for realizing the designing 
method of optical system should be preferably 
optically or magnetically recorded in a predetermined 
recording medium such as CD, MO, FD, a hard disk, a 
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magnetic tape, or ROM. 

According to the above constitution, the present 
invention can automatically generate an optical system 
as being a globally optimal solution independent of 
the initial solution given. Namely, the invention 
makes it possible to find an optimal solution or a 
near-optimal solution of plural parameters (components 
of a real vector) featuring the optimal optical system 
within a practical time from the initial solution 
arbitrarily given. 

In the conventional designing method and 
modifying method of optical system, it is also 
conceivable to employ a method for obtaining a 
plurality of local optimal solutions by repeating such 
an operation that once a search falls in a local 
optimal solution and if little change appears in 
increase or decrease of the performance function with 
repetition of improving procedure^ the local optimal 
solution of the optical system obtained at this time 
is once stored as a solution candidate on the memory 
and that the improving procedure is carried out again 
for a new optical system to be improved, obtained by 
arbitrarily changing one or more pareuaeters out of the 
plural parameters of the solution, thereby obtaining 
another local optimal solution. However, as the 
optical system becomes more complex with increase in 
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the number of optical elements, the nvunber of local 
optimal solutions becomes so large in general. It is, 
therefore r difficult to obtain the globally optimal 
solution by this method. 

Since plural optical systems as multi-objective 
optimal solutions, which must exist in correspondence 
to a plurality of conflicting evaluation criteria, can 
be selected simultaneously in the above survival 
selection step, when a plurality of evaluation 
criteria for evaluation of optical system are given, a 
plurality of Pareto optimal solutions or a plurality 
of Pareto near-optimal solutions can be obtained 
within a practical time by performing the multi- 
objective optimization for simultaneously handling the 
plural evaluation criteria. 

Further, according to the present invention, an 
optimal solution or a near solution to the optimal 
solution of the parameters featuring the optimal, „ 
optical system can be obtained within a practical 
computation time even in a range satisfying 
constraints on (plural) specific parameters 
arbitrarily designated by the user. For example, even 
in the case where the constraints given are that the 
first surface of the first lens must be convex and 
that a refractive index of a glass material of the 
second lens must be x (e.g., 1.5266), y (e.g., 1.6010), 
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or z (e.g., 1.7294), the multi-objective optimization 
can be done within the range satisfying such 
constraints . 

Fig. 1 is a drawing to show a solution of a 
movement sequence (the first example) obtained by the 
conventional approach based on the rule of thumb; 

Fig. 2 is a drawing to show a solution of a 
movement sequence (the second example) obtained by the 
conventional approach based on the rule of thumb; 

Fig. 3 to Fig. 5 are drawings for explaining the 
coding/crossover procedures in the conventional 
designing method of optical system using the discrete 
GA; 

Fig. 6 is a drawing to show the overall 
structure of the projection exposure apparatus 
(stepper) including the alignment apparatus provided 
with the arithmetic section for realizing the 
determining method of movement sequence according to 
the present invention; 

Fig. 7 is a flowchart for explaining the 
measurement process of alignment mark position for the 
EGA arithmetic; 

Fig. 8 is a table to show the performance of the 

wafer stage etc . ; 

Fig. 9 is a table to show chip areas positions 
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of which are to be measured, and coordinates of their 
centers (representing the positions of the chip 
areas ) ; 

Fig. 10 is a table to show a movement time 
management table; 

Fig. 11 is a drawing to show locations of chip 
areas and alignment marks provided on a wafer; 

Pig. 12 is a drawing to show a first arrangement 
example of alignment marks in a chip area; 

Fig. 13 is a drawing to show a second 
arrangement example of alignment marks in a chip area; 

Fig. 14 is a diagram to show an example of 
flowchart for explaining an example of the genetic 
algorithm applied as a search technique of movement 
sequence; 

Fig. 15 is a diagram to show an example of 
flowchart for explaining another example of the 
genetic algorithm applied as a search technique of 
movement sequence; 

Fig. 16 is a drawing for explaining a crossover 
operator in the genetic algorithm; 

Fig. 17 is a drawing for explaining a mutation 
operator (cyclic shift) in the genetic algorithm; 

Fig. 18 is a drawing to show a solution of a 
movement sequence obtained by the determining method 
of movement sequence according to the present 
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invention; 

Fig. 19 is a drawing to show a solution of a 
movement sequence obtained by the determining method 
of movement sequence according to the present 
invention^ to which the linear programming method (NN 
method) is applied; 

Fig, 20 is a flowchart for explaining the 
determining method of movement sequence according to 
the present invention, to which the NN method is 
applied; 

Fig. 21 is a drawing to show a solution of a 
movement sequence obtained by the determining method 
of movement sequence according to the present 
invention, to which the LK method utilizing initial 
solutions obtained by the NN method is applied; 

Fig. 22 is a flowchart for explaining the 
determining method of movement sequence according to 
the present invention, to which the LK method is 
applied; 

Fig. 23 is a drawing to show the relation 
between the computation time of the approach based on 
the rule of thumb, the NN method, and the LK method, 
and the overall movement time of the movement 
sequences obtained thereby; 

Fig. 24 is a drawing to show the relation 
between the computation time of the GA using randomly 
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generated feasible base solutions as an initial group 
and the overall movement time of the movement sequence 
obtained; 

Fig. 25 is a drawing to show the relation 
between the computation time of the GA using plural 
solutions obtained by the NN method + the LK method as 
an initial group and the overall movement time of the 
movement sequence obtained thereby; 

Fig. 26 is a drawing to show a designing 
apparatus of optical system according to the present 
invention and a medium in which a program for 
realizing the designing method of optical system 
according to the present invention is recorded; 

Fig. 27 is a drawing for explaining the 
crossover operator in the designing method of optical 
system according to the present invention; 

Fig. 28 is a drawing for explaining a generation 

alteration model in the designing method of optical 

system (the first embodiment) according to the present 
invention; 

Fig. 29 is a drawing for explaining an example 
of the initial population in the designing method of 
optical system (the first embodiment) according to the 
pr es ent invent ion ; 

Fig. 30 is a drawing for explaining another 
example of the initial population in the designing 

32 


NIK97-13US 

method of optical system (the first embodiment) 
according to the present invention; 

Fig, 31 is a path diagram of an optical system 
(a lens system) obtained in Experiment 1 of the 
5 designing method of optical system (the first 

embodiment) according to the present invention; 

Fig. 32 is an aberration diagram of the lens 
system shown in Fig. 31; 

Fig. 33 is a drawing to show path diagrams and 
10 spot diagrams of lens systems (focal length 50 mm, 

brightness F 2.0, field angle 46'') obtained in 
Experiment 2 of the designing method of optical system 
(the first embodiment) according to the present 
invention; 

15 Fig. 34 is a drawing to show path diagrams and 

spot diagrams of lens systems (focal length 13 5 mm, 
brightness F 2.8, field angle 18.2*') obtained in 
- Experiment 2 of the designing method of optical system 
(the first embodiment) according to the present 

20 invention; 

Fig. 35 is a drawing to show path diagrams and 
spot diagrams of lens systems (focal length 20 mm, 
brightness F 5.6, field angle 92*") obtained in 
Experiment 2 of the designing method of optical system 

25 (the first embodiment) according to the present 

invention; 
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Fig. 36 is a drawing for explaining a generation 
alteration model in the designing method of optical 
system (the second embodiment) according to the 
present invention; 
5 Fig. 37 is a drawing to show a Pareto solution 

set obtained by the designing method of optical system 
(the second embodiment) according to the present 
invention; 

Fig. 38 is a drawing to show a state in which 
10 the optimal solution obtained in Experiment 1 of the 

designing method of optical system (the first 
embodiment) according to the present invention is 
plotted on Fig. 37; 

Fig. 39 is a drawing for specifically explaining 
15 parameters in the designing method of optical system 

(the third embodiment) according to the present 
invention; 

Fig. 40 is a drawing to show a converging state 
of various beams on the image plane, where the spot 
20 diagram is used/ as an example, for evaluating the 

photographic lens of Fig. 39; 

Fig. 41 is a drawing to show an example (a real 
vector) of gene representation representing a 
plurality of continuous parameters for a lens optical 
25 system in the form of continuous values, where in the 

case of Fig. 39 the field angle, focal length, and 
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refractive indices of glass materials for the 
respective lenses are preliminarily given and they are 

constraints; 

Fig, 42 is a flowchart for explaining the 
designing method of optical system (the fourth 
embodiment of the designing method of optical system 
according to the present invention); 

Fig. 43 is a drawing to schematically show 
operations in the designing method of optical system 
(the fourth embodiment) according to the present 
invention; 

Fig. 44 is a drawing for explaining the 
crossover operator where the UNDX is applied; 

Fig. 45 is a flowchart for explaining the 
operation where the multi-objective optimization is 
effected in the designing method of optical system of 
the fourth embodiment according to the present 
invention; 

Fig. 46 is a drawing to schematically show 
operations where the multi-objective optimization is 
effected in the designing method of optical system of 
the fourth embodiment according to the present 
invention; 

Fig. 47 is a path diagram of a projection 
optical system (No. 1) generated by the designing 
method of optical system (the fourth embodiment) 
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according to the present invention; 

Fig. 48 is an aberration diagram of the 
projection optical system shown in Fig. 47; and 

Fig. 49 is a path diagram of a projection 
5 optical system (No. 2) generated by the designing 

method of optical system (the fourth embodiment) 
according to the present invention. 

DRTATT^ED nflSCRTPTTON OF THR PRRFKT^PKD EMBODIMENTS 

The embodiments of the determining method of 
10 movement sequence and alignment apparatus according to 

the present invention will be described using Fig. 6 
to Fig. 25. The embodiments are examples to find the 
optimum movement sequence indicating the order of 
measurement of alignment mark positions in the EGA 
15 (Enhanced Global Alignment: aligning by the 

statistical arithmetic method) in the step-and-repeat 
type (one-shot type) projection exposure apparatus. It 
should be noted that the present invention can also be 
applied to the step-and-scan type exposure apparatus 
20 for synchronously scanning the reticle relative to the 

wafer. 

Fig. 6 shows the structure of a projection 
exposure apparatus (stepper). In this Fig. 6. the 
reticle R is illuminated under almost uniform 
25 illuminance by exposure light IL emitted from 

illumination optical system 1. The reticle R is held 
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on reticle stage 3 and the reticle stage 3 is 
supported so as to be movable and finely rotatable in 
a two-dimensional plane on base 4. Main control system 
6 for controlling the operation of the whole apparatus 
5 controls the operation of the reticle stage 3 through 

driving device 5 provided on the base 4 . 

The above main control system 6 has an 
arithmetic section 61 for carrying put the determining 
method of movement sequence according to the present 

10 invention and a memory 62 in which a movement time 

management table (see Fig. 10) is stored. This main 
control system 6 receives instructions from the 
operator or the like through input device 63 such as a 
keyboard or a pointing device. 

15 An image of a pattern of the reticle R 

illuminated by the exposure light IL is projected 
through a projection optical system PL onto each chip 
area on the wafer W. The wafer W is mounted on wafer 
stage 10 through wafer holder 9. The wafer stage 10 is 

20 composed of an XY stage for two-dimensionally aligning 

the wafer W on a plane normal to the optic axis of the 
projection optical system PL, a Z-stage for aligning 
the wafer W in a direction (Z-direction) parallel to 
the optic axis of the projection optical system PL^ a 

25 stage for finely rotating the wafer and other 

components . 
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A moving mirror 11 is fixed on the top surface 
of the wafer stage 10 and a laser interferometer 12 is 
positioned opposite to the moving mirror 11. Although 
the moving mirror 11 is illustrated in a simplified 
5 form in Fig. 6^ where the orthogonal coordinate system 

is defined with the X-axis and Y-axis in the plane 
normal to the optic axis of the projection optical 
system PL/ the moving mirror 11 is composed of a plane 
mirror having a reflective surface normal to the x- 

10 axis and a plane mirror having a reflective surface 

normal to the Y-axis. The laser interferometer 12 is 
composed of two X-axis laser interferometers for 
emitting a laser beam to the moving mirror 11 along 
the X-axis and a Y-axis laser interferometer for 

15 emitting a laser beam to the moving mirror 11 along 

the Y-axis, and the one X-axis laser interferometer 
and one Y-axis laser interferometer measure X- 
coordinate and Y-coordinate of the wafer stage 10. 

An angle of rotation of the wafer stage 10 is 

20 measured from a difference between measured values by 

the two X-axis laser interferometers . The information 
about the X-coordinate, Y-coordinate, and angle of 
rotation measured by the laser interferometer 12 is 
supplied to position measuring circuit 12a and to the 

25 main control system 6. The main control system 6 

controls the alignment operation of the wafer stage 10 
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through driving device (linear motor M) 13 while 
monitoring the coordinate data supplied. The same 
interferometer system as on the wafer side is also 
provided on the reticle side, though not illustrated 
in Fig. 6. 

Further^ the projection optical system PL of Fig. 
6 is equipped with imaging property controlling device 
14. The imaging property controlling device 14 
controls the projection magnification and distortion 
of the projection optical system PL/ for example, by 
adjusting a space between predetermined lenses out of 
a group of lenses composing the projection optical 
system PL or by adjusting the pressure inside the lens 
chamber between predetermined lenses . The main control 
system 6 also controls the operation of the imaging 
property controlling device 14. 

This embodiment uses an off -axis alignment 
optical system of an image processing type as a method 
for obtaining a position of an alignment mark 
(coordinates of the center thereof). However/ the 
position of the alignment mark can also be obtained by 
receiving diffracted light by the alignment optical 
system of the TTL (through the lens) type. 

The off -axis alignment optical system (device) 
15 is placed beside the projection optical system PL 
of Fig. 6. This alignment system 15 forms an image of 
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a cross alignment mark AM on an image pickup surface 
of an X-axis image pickup device composed of a two- 
dimensional CCD for the X-axis and also forms the 
image on an image pickup surface of a Y-axis image 
5 pickup device composed of a two-dimensional CCD for 

the Y-axis. The image of the alignment mark AM and an 
image of an index mark provided on an index screen 
(not illustrated) are superimposed on each of the 
image pickup surfaces of the image pickup devices. 

10 Image signals of the image pickup devices are supplied 

to the position measuring circuit 12a ^ which 
calculates amounts of positional deviation in the X- 
axis direction and in the Y-axis direction between the 
image of the alignment mark AM and the image of the 

15 index mark. 

Accordingly/ in Fig. 6, the position measuring 
circuit 12a obtains coordinates of the alignment mark 
AM on the stage coordinate system (Xr Y) from -the 
positional relation between the image of the alignment 

20 mark AM on the wafer W and the index mark on the index 

screen and the measurement results of the laser 
interferometer 12 in that case, and supplies the 
coordinate data thus measured to the main control 
system 6. 

25 Fig. 7 is a flowchart for explaining the 

position measurement of alignment mark for the EGA 
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arithmetic being the statistical arithmetic method* 
The movement control of the XY stage for the position 
measurement of alignment mark will be described along 
this flowchart. 

5 First, the operator selects a plurality of chip 

areas for the EGA measurement from the all chip areas 
in the wafer W and the data is supplied to the main 
control system 6 (step STlOl), The operator is allowed 
to select the all chip areas. 

10 Then the operator determines which or how many 

marks should be measured out of the plural alignment 
marks AM mainly provided in the peripheral region of 
each chip area, and the information is supplied to the 
main control system 6 (step ST103). In this embodiment 

15 there are four cross alignment marks AM (which permit 

two-dimensional coordinate measurement) positioned per 
each chip area. 

The present invention permits use of an 

arrangement in which two alignment marks AM are 

20 positioned per each chip area or an arrangement of a 

linear alignment mark AM (permitting one-dimensional 
coordinate measurement) for each chip area. It is 
necessary to obtain coordinates of ten alignment marks 
AM on a one-dimensional basis, in order to obtain (1) 

25 residual rotation error G of the wafer, (2) 

orthogonality error W of the stage coordinate system 


41 



NIK97-13US 

(or shot arrangement), (3) linear expansion or 
contraction Rx, Ry of the wafer, (4) offset 
(translation) OX, OY of the wafer (the center 
position), (5) residual rotation error 6 of the 
5 circuit pattern (chip pattern) on each shot area of 

the wafer, (6) orthogonality error w of the coordinate 
system (chip pattern) on the wafer, and (7) linear 
expansion or contraction rx, ry in two orthogonal 
directions of the chip pattern (see Japanese Laid-open 

10 Patent Application No. Hei 6-275496). 

Next, the main control system 6 sends a command 
to a loader (not illustrated) to make the loader mount 
(or load) the wafer W to be exposed on the wafer 
holder 9 (step ST105). In this case, the XY stage 10 

15 moves to a predetermined mount position for receiving 

the wafer W. 

After the wafer W is mounted on the wafer holder 
9 to be sucked by vacuum, the main control system 6 
moves the XY stage 10 so as to locate the measuring 

20 point (representing the measuring area) of the 

alignment optical system 15 at the start point ST, for 
carrying out the EGA (step ST107). 

In the main control system 6, almost at the same 
time as the arithmetic section 61 sends the load 

25 command of wafer W described above, the arithmetic 

section 61 starts finding a sequence of a short time 
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indicating a measurement order of the chip areas and 
alignment marks AM given in steps STIOI and ST103 by a 
search technique (a linear programming method, a Lin 
and Kernighan's approach/ a K-Opt method, or a genetic 
5 algorithm) (step ST 109 )• The arithmetic executed in 

this arithmetic section 61 will be described 
hereinafter. Designed center coordinates of the 
respective chip areas and designed coordinates of the 
respective alignment marks AM are preliminarily 

10 recorded in the memory 63 in the main control system 6. 

Next, in step STlll, the main control system 6 
moves the XY stage according to the measurement order 
of the alignment marks AM obtained by the genetic 
algorithm in the arithmetic section 61. A position of 

15 each alignment mark AM is measured by the alignment 

optical system, whereby an amount of positional 
deviation thereof is measured relative to the index 
mark* At this time values of the laser interferometer 
12 are also read, whereby coordinates of the alignment 

20 mark AM are obtained on the stage coordinate system (X, 

Y) by the position measuring circuit 12a. The 
coordinate values measured are supplied to the main 
control system 6 . 

At the time of completion of the measurement of 

25 positions of the all alignment marks as being 

measurement targets, the main control system 6 moves 
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the XY stage so as to move the measuring point of the 
aligrunent optical system 15 to the end point EN (step 
ST113), 

The arithmetic section of the main control 
system 6 further performs the EGA arithmetic ^ based on 
the information about the positions of the plural 
alignment marks measured, thereby obtaining the error 
parameters, expansion or contraction of the wafer, 
rotation of each chip area, etc, (step ST115), 

Then the main control system 6 moves the XY 
stage 10 to coordinates of each chip area to be 
exposed, according to the computation result of EGA or 
compensates for expansion or contraction of chip area 
or the like by the imaging property controlling device 
14, and then performs exposure (step ST117). 

Next described is the optimization of movement 
sequence for the position measurement of alignment 
marks. A path (order) of the position measurement of 
alignment marks is obtained in the arithmetic section 
61 in the main control system 6, and preconditions 
necessary for the arithmetic will be described first. 

Fig. 8 to Fig. 13 are drawings to illustrate the 
preconditions necessary for obtaining the movement 
sequence for the position measurement of alignment 
marks, in order to compare the effect of the present 
invention with the conventional technology. Of course. 
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this is just an example / and it should be noted that 
the conditions including the size of chip area^ the 
number of chips ^ the positions of alignment marks, the 
number of alignment marks, and so on can be changed 
freely. 

Fig. 8 shows data of the stepper used in the 
present embodiment, including the start point ST and 
end point EN of the wafer stage 10. 

Accelerations and maximum speeds of movement of 
the wafer stage 10 are as shown in Fig. 8/ because the 
time of movement in the X-direction is different from 
that in the Y-direction because of the difference 
between the weights of the Y-stage, mechanisms, and so 
on. 

In this embodiment, the time necessary for 
measurement of a position of one alignment mark is set 
to 0.5 sec, as shown in Fig. 8. 

Fig. 9 shows coordinates of centers of twenty 
chips (X, Y in units of mm) denoted by nximerals 12, 13, 
14, 15, 21, 22, 25, 26, 31, 36, 41, 46, 51, 52, 55, 56, 
62, 63, 64, 65 in the coordinate system on the wafer, 
and the twenty chip areas are those measured in this 
embodiment. 

Fig. 10 is a drawing to show the movement time 
management table preliminarily stored in the memory 62. 
From this movement time management table, moving times 
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between alignment marks are found where four alignment 
marks are provided for each of the twenty chip areas 
being measurement targets. For example, in this 
movement time management table, the time indicated by 
F in the drawing represents a time necessary for the 
third alignment mark in the chip area 12 on the wafer 
W to move from a position thereof at the time of 
completion of the measurement of the position of the 
second alignment mark of the chip area 65 on the wafer 
W to the measuring point of the alignment optical 
system 15. This movement time management table also 
includes movement prohibition information for 
indicating prohibited movement. This movement 
prohibition information is indicated by symbol "X" in 
the drawing. 

Fig. 11 is a drawing to show an arrangement of 
the chip areas on the wafer. Particularly, the chip 
areas indicated by hatching are those where the 
position measurement of the alignment marks specified 
in Fig. 9 is carried out. The size of each chip area 
is 22 mm (in the X-direction) x 22 mm (in the Y- 
directi^on). Distances between centers of chip areas 
are 22 mm in the X-direction and 22 mm in the Y- 
direction. 

Next, Fig. 12 shows cross alignment marks AM 
disposed in each chip area to be a measured object. In 
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this embodiment there are four alignment marks AMI to 
AM4 in each chip area as a measured object. When the 
center coordinates AO of the chip area are defined by 
(0^ 0)/ the position measurement of the alignment 
marks in the chip area is set to be carried out in the 
order of AMI (10, 10), AM2 (10, -10), AM 3 (-10, -10), 
and AM4 (-10^ 10). Of course, this is just an example, 
and the alignment marks AM 1 ' to AM4 ' can be 
positioned as shown in Fig. 13 on the chip area. 

The position measurement of the alignment marks 
is started after the wafer stage 10 has been moved so 
that the optic axis of the alignment optical system 15 
(or the measuring point on the wafer W by the 
alignment optical system 15) was aligned with the 
start point ST (X = 186.5, Y = 155.5). The position 
measurement is terminated at the time of arrival of 
the optic axis of the alignment optical system 15 (the 
measuring point on the wafer) at the end point EN (X = 
215, Y =133). 

Since the measurement time at each alignment 
mark AM is constant, the optimization of movement 
sequence means eliminating waste movement time from 
the movement sequence as much as possible. Let us 
consider a time from a point of completion of the 
position measurement at an arbitrary alignment mark AM 
(a start point of a unit movement sequence) to a point 
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of start of the measurement process at another 
arbitrary alignment mark AM (an end point of the unit 
movement sequence). In the case where there are n 
alignment marks AM, the number of cases of arbitrary 
5 unit movement sequences is at most n x (n - 1). Since 

the shortest movement times of the respective unit 
movement sequences are uniquely determined from the 
aforementioned preconditions, these are computed first 
and the movement time management table obtained (see 

10 Fig. 10) is stored in the memojry 62. After completion 

of the operation up to this point, the optimum 
movement sequence can be obtained by optimization of 
selection and order to determine which unit movement 
sequences should be selected out of the unit movement 

15 sequences and used in what order (in the sense of 

minimizing the final measurement process sequence of 
coordinates of alignment marks AM). Accordingly, to 
minimize the overall movement time of the movement 
sequence means to optimize the movement sequence for 

20 the position measurement of alignment marks AM, which 

is an object of the present invention. This method 
will be described below. 

This embodiment is arranged to automatically 
obtain the optimum solution of the order (movement 

25 sequence) of position measurement of alignment marks 

(a solution of a movement sequence to minimize the 
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overall movement time), using the genetic algorithm 
(GA: Genetic Algorithm) ^ which is a typical example of 
the evolutionary computation methods Fig. 14 and Fig. 
15 illustrate examples of the operation of the genetic 
5 algorithm. 

The name of the GA was given/ because individual 
operations thereof could be compared to genes, as will 
be described below. 

An initial solution in the GA is arbitrary. 

10 Accordingly, the genetic algorithm can be readily 

combined with another search method and it is also 
easy to incorporate only an advantage of another 
search method into a genetic operator. Therefore, 
proper analysis of problem and programmer's 

15 experiences in evolutionary computation are required 

to achieve an efficient approach with high degrees of 
freedom of design. 

The GA of this embodiment can be classified 
roughly into GA as a combinatorial optimization 

20 approach placing the main point on crossover, ES 

(Evolutional Strategy) as a continuous value 
optimization approach placing the main point on 
mutation, and, different in classification from these, 
GP (Genetic Programming) directed to the source and 

25 process procedures of program, but the essence of the 

all algorithms is identical. 
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In the GA, a group search is carried out with 
plural (N) agents called "genes," and the GA thus has 
such a property that even if a part of the group falls 
into a local optimum solution but if the other gene 
finds a better solution the search is led to the 
better solution. Since it is a multipoint search, it 
takes some time, but can efficiently search the 
optimum solution. A plurality of pairs (parents) are 
selected from the gene group consisting of the N genes 
and each of the gene pairs P bears children C 
resembling their parents. Further, some children C 
experience mutation in part of gene. Among these genes, 
descendents having genes with higher evaluation are" 
made to survive with higher probability in the next 
generation. Since the size of the group is normally 
fixed to N throughout generations except for special 
cases, descendents having genes with lower evaluation 
will gradually become extinct. Repetition of such 
alteration of generation will find plurail (N) genes 
with the optimum solution appearing in the group 
sooner or later. 

This embodiment adopts the Subtour Exchange 
Crossover (SXX) famous as an approach by GA in order 
to improve the movement sequence. In this way the 
optimization problem of movement sequence for the 
position measurement of alignment marks in the stepper 
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is formulated by the GA. Examples of models for the 
alteration of generation used in the GA include the 
MGG (Minimum Generation Gap) model in which the all 
genes are paired without any unmarried person and best 
two genes out of each family including a pair and 
children born are left to the next generation (H. 
Satoh, Yamamura and S. Kobayashi, Minimum 
Generation Gap Model for GAs Considering Both 
Exploration and Exploitation / Proceedings of 
IIZUKA '96, pp. 494-497), and the Elitist model in 
which good people out of the all parents and children 
are left preferentially (D. E. Goldberg, Genetic 
Algorithm in Search, Optimization and Machine Learning, 
Addison-Wesley Publishing Company Inc., 1989). The MGG 
model is used if the true optimum solution is desired 
to obtain finally even with some computation time; 
whereas the Elitist model is used if a good solution 
can be found in the early stage of the computation 
process and even if a computation time longer than 
that in the MGG model is necessary for obtaining the 
true optimum solution. Thus, they can be used 
selectively based on how to make the trade-off between 
the computation time and the quality of solution. In 
this embodiment, an example using the Elitist model 
will be described as an example of such selective use. 
Before describing the GA shown in the flowcharts 
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of Fig. 14 and Fig. 15 in detail/ each of operators 
shown in the algorithms will be described first. 

In Fig. 16 and Fig. 17 Pi, P2/ CI, C2, etc. 
represent measurement orders of alignment mark 
positions / imitating the genes. One unit is defined as 
a position of one measurement of an alignment mark. 
One gene is expressed by a string of n units 
corresponding to n alignment marks. Since in this 
embodiment the number of chip areas position-measured 
is 20 and the number of alignment marks in each chip 
area is 4, one gene is composed of 20 x 4 = 80 units. 
For simplification of description, the description 
will be given as to an example in which eight 
alignment marks are measured and denoted by A to H. 
Namely / each operator will be explained using an 
example in which one gene is composed of eight units. 

Fig. 16 is a drawing to illustrate a crossover 
operator for subtour exchange crossover , which is one 
of the genetic operators. The concept of the crossover 
operator is to make from two parents two or more 
children having characteristics of the both parents . 
The subtour exchange crossover operator is applied to 
a pair of genes Pi and P2 randomly selected out of 
plural genes. At this time the crossover operator 
searches portions having a common partial set of 
alignment marks in the genes Pi and P2. The third unit 
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to the sixth unit of the gene Pi are a set of 
alignment marks C, D, E, and F. Further , the fourth 
unit to the seventh unit of the gene P2 are also a set 
of alignment marks and F. 

5 These common portions (subtours) are exchanged 

between the gene PI and the gene P2 . Namely^ new genes 
of CI and C2 are generated. In this embodiment genes 
C3 and C4 are further generated by inversion of the 
partial information of the exchanged portions from the 
10 genes CI and C2 • Namely, the two parents have four 

children. 

Fig. 17 is a drawing to illustrate a mutation 
operator, which is one of the genetic operators. The 
concept of the mutation operator is to generate 

15 children having characteristics that their parents do 

not have. Fig. 17 illustrates the mutation operator, 
particularly, for improvement in the movement order of 
object (the order of position measurement of alignment 
marks: cyclic shift). The mutation operator is applied 

20 to a pair of genes P3 and P4 randomly selected out of 

plural genes. 

T)iis mutation operator first searches portions 
having a common partial set of alignment marks in the 
genes P3 and P4 . In Fig. 17, the second unit to the 

25 fifth unit of the gene P3 are a set of alignment marks 

B, C, D, and E, while the fourth unit to the seventh 
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unit of the gene P4 are a set of alignment marks C, 
D, and £• Thus, the two sets are identical. 

A difference of units between such partial 
strings appearing in the genes P3 and P4 is defined as 
5 a number of shifts. In Fig. 17 the partial string 

starts from the second unit in the gene P3 while the 
partial string starts from the fourth unit in the gene 
P4; therefore, there is the difference of two units 
and thus the number of shifts is 2. Then the units 

10 appearing in the partial set of alignment marks are 

shifted in order and in cycle by the number of shifts, 
thus generating new genes C5 and C6. This means that 
the gene C5 is obtained in such a way that the partial 
set of BODE in the gene P3 is shifted once to be CDEB 

15 and then it is shifted once more to be DEBC. The gene 

C6 is also obtained in the same way. 

The contents of each operator were detailed 
above and now, returning to the flowcharts of Fig. 14 
and Fig. 15, the procedures of application of each 

20 operator will be described. 

First, genes of a gene group consisting of genes 
numbering N preliminarily set (for example, N = 80) 
are generated randomly (step ST12). Each gene in this 
group is determined as one for measuring each of the 

25 all positions of the alignment marks numbering n (for 

example, n = 80) once. Each of the genes can be a 
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solution generated (or updated) by an approach based 
on the rule of thumb, an approach based on the linear 
programming method (for example, the nearest neighbor 
method which will be called the NN method and the 
details of which will be described hereinafter, or the 
like), or the Lin and Kernighan's approach 
(hereinafter referred to as the LK method the details 
of which will be described hereinafter) with an 
initial solution being a constraint satisfying 
solution generated arbitrarily; or a solution 
generated (or updated) by the LK method with an 
initial solution being a constraint satisfying 
solution generated by the approach based on the linear 
programming method (for example, the NN method or the 
like). Since the generation t at this time is t = 1, 
the gene group of the first generation will be called 
Gl. The overall movement times of movement sequences 
are computed for the position measurement of alignment 
marks of individual genes in the gene group Gl of the 
first generation. For example, the overall movement 
times are 64.278 sec for the first gene, 63.448 sec 
for the. second gene,..., 56.163 sec for the twenty fifth 
gene, and 53.04 6 sec for the twenty sixth gene. 

In one alteration of generation, first, the 
crossover is carried out in the number of crossovers 
according to a crossover rate preliminarily set (Pc = 
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0.4): N X PC times (80 x 0.4 = 32 times) out of the 
group of the N (= 80) genes. A pair of (two) genes as 
objects of one crossover are sampled without 
replacement (or without permitting crossover of a same 
gene) preferentially from the shortest measurement 
time out of the gene group. This selection of the 
paired genes is carried out the number of crossovers 
times by sampling with replacement (permitting one 
gene to be selected plural times for different pairs) 
out of the gene group. One of specific, applicable 
methods for preferentially selecting the genes of 
short measurement time is the roulette wheel selection, 
which will be described in step ST36. Another 
applicable method is a method for selecting N x Pc x 2 
genes in order from the shortest overall movement time 
of movement sequence out of the gene group and 
randomly selecting pairs of N x Pc genes therefrom. 

When the crossover operator is applied to each 
of these gene pairs (step ST16), new genes numbering N 
xPcx2 (80x0.4x2=64) are generated . This 
operator produces a group of new-born genes A (= N x 
PC X 2). (step ST18) . 

Next, the "mutation of ordinal shift of position 
measurement of alignment marks" is carried out A x Pm 
times (64 x 0.4 = 25.6 times: which is raised to an 
integer, 26) according to the rate of "mutation of 
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ordinal shift of position measurement of alignment 
marks" preliminarily set (Pm = 0,4) out of these new- 
born genes. Selection of one gene as an object of one 
"mutation of ordinal shift of position measurement of 
alignment marks" is carried out by random sampling 
with replacement (permitting one gene to be selected 
as a mutation object plural times) out of the gene 
group (step ST20: selection 2). When the "mutation 
operator of ordinal shift of position measurement of 
alignment marks" is applied to the totally twenty six 
genes selected (step ST22)^ new genes numbering A x Pm 
(twenty six genes) are generated. This operator 
produces a group of new-born genes B (= A x Pm = N x 
PC X 2 X Pm) (step ST24). 

By the process heretofore, there are the gene 
group Gl of the first generation (N = 80), the new- 
born gene group A (64), and the new-born gene group B 
(26). Totally^ there are the genes numbering N x {1 + 
2 X PC X (1 + Pm)> (170 genes) (step ST26). In the 
case of the GA in the flowchart shown in Fig. 14, the 
steps up to this point correspond to multiplication of 
new-born genes by the genetic operators. In the case 
of the GA in the flowchart shown in Fig. 15, 
multiplication by the LK operator detailed hereinafter 
is introduced immediately after this point. 

After completion of multiplication of new-born 
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genes in each generation (i.e,^ in one loop in the 
flowchart shown in Pig. 14 or Fig. 15), in the final 
step of the loop a survival process is carried out to 
select genes to be left as a gene group of the next 
generation (i.e., an initial gene group in the next 
loop) and to dismiss the other genes (step ST36). 
Namely, the movement times are checked for the all 
movement sequences corresponding to the individual 
genes included in the new-born gene group A and the 
new-born gene group B generated by the genetic 
operators (the new-born gene group C is further added 
in the case of the GA in the flowchart shown in Fig. 
15) in addition to the initial gene group Gt. Then the 
best gene, which is a gene to achieve a movement 
sequence having the shortest movement time among them, 
is outputted. At this time, if S different genes all 
are the best, these S genes are left unconditionally 
in the next generation. Further, (N - S) genes are 
also selected out of the remaining (170 - S) genes by 
such a selecting method as to select a gene 
corresponding to a movement sequence of a shorter 
movement time more preferentially. These genes are 
also left in the next generation (i.e., in the (t + 
l)-th generation). Namely, the survival process is 
carried out so as to select totally N genes including 
those of the movement sequences of the shortest 
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movement time and the other genes (step ST36), This 
selection method is not one for selecting N movement 
sequences from the shortest movement time^ but is one 
for selecting epistatic movement sequences with a 
5 higher priority but still leaving a possibility of 

selecting even a hypostatic movement sequence (of 
longer movement time), though low. The reason why the 
possibility of selecting a hypostatic movement 
sequence is left is that the true optimum solution is 

10 not always derived from a group of epistatic movement 

sequences, but is also possibly derived from a group 
of hypostatic movement sequences. Meanwhile, if a 
solution derived from a hypostatic movement sequence 
is also still hypostatic in the next generation, it 

15 will be naturally dismissed. Therefore, it will not be 

a hindrance against finding of the true optimum 
solution. The roulette wheel selection can be applied 
as a selection method of this type. For excimple, 
supposing in the optimization problem of movement 

20 sequence there are three candidates X, Y, Z as 

solutions of movement sequences and their movement 
times are 10 sec, 20 sec, and 20 sec, respectively, it 
is contemplated that the inverse of the time is set as 
a fitness value of each solution. At this time, 

25 supposing one solution is selected out of these three 

solutions by roulette wheel selection, probabilities 
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of selection of the solutions and Z are 0.5^ 

0.25^ and 0.25/ respectively. The eighty new genes 
thus selected compose a set of genes of the next 
generation, Gt+1. Then, returning to step ST14/ the 
5 same process with the genetic operators is repeated. 

As apparent from the algorithm to repeat this 
alteration of generation, for example, supposing 
exchange of reticle takes 10 sec and within that 
period the computation is completed, for example, up 

10 to the fortieth generation, the shortest movement 

sequence for position measurement of alignment marks 
can be obtained among the group of genes (solutions) 
of the fortieth generation. Supposing exchange of 
reticle takes 11 sec and within that period the 

15 computation is completed up to the forty first 

generation, the shortest movement sequence for 
position measurement of alignment marks can be 
obtained among the group of genes (solutions) of the 
forty first generation. 

20 Further, as shown in Fig. 15, the LK method can 

be incorporated as one of the genetic operators into 
the GA in order to obtain the optimum solution or the 
best solution more efficiently. In this case, the LK 
method (hereinafter referred to as an LK operator) is 

25 applied to plural genes (for example, to the all 

genes) in the gene group including the new-born genes 
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obtained after the application of the crossover 
operator and mutation operator (step ST30). Thus, a 
new-born gene group C (the number of new-born genes 
included in C is N x 2 x Pc x (1 + Pm)) is newly 
produced herein (step ST32). Namely, immediately 
before the survival, there exist the genes numbering 2 
xNx2xPcx(l+ Pm) (Step ST34). The preset 
crossover rate Pc, the preset mutation rate Pm, the GA, 
etc. are stored in the memory 62 of the main control 
system 6 (Fig, 6), 

Since the above embodiment minimizes the overall 
movement time of the movement sequence for position 
measurement of alignment marks by applying the GA to 
the projection exposure apparatus, it makes possible 
scheduling of the order (path) of position measurement 
of each alignment mark. 

The present embodiment also makes possible the 
optimization in which the local search by the 
crossover operator is merged with the global search by 
the mutation operator. In addition, since the best 
solution of each generation is presented every 
alteratp.on of generation by the GA, a near-optimum 
solution consistent with a computation time can be 
outputted even with interruption of computation within 
the allowed computation time. 

The crossover operator, SXX, in the present 
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embodiment can be replaced by another crossover 
operator used in approaches to TSP^ such as OX: Order 
Crossover (L. Davis / "Applying Adaptive Algorithms to 
Epistatic Domains," Proceedings of International Joint 
5 Conference on Artificial Intelligence (IJCAI), 1985), 

PMX: Partially Mapped Crossover (D. E. Goldberg and R. 
Linge, "Alleles, Loci, and the Traveling Salesman 
Problem, " Proceedings of International Conference on 
Genetic Algorithms (ICGA), 1985), or EAX: Edge 

10 Assembly Crossover (Nagata, Ono, and Kobayashi: 

"Proposal and evaluation of edge assembly crossover: 
New crossover of TSP considering tradeoff between 
character heredity and degree of freedom of 
crossover," System and Information Joint Symposium '96, 

15 Keisoku Jido Seigyo Gakkai, 1996 )• 

Next described is the difference between the 
determining method of movement sequence according to 
the present invention and the conventional determining 
method of movement sequence. Specifically, for clearly 

20 expressing the difference between the solution of 

movement sequence obtained by the present invention 
(the result of execution of the determining method 
according to the present invention) and the solution 
of movement sequence obtained by the conventional 

25 technology (the result of execution of the 

conventional determining method), the following 
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description shows the difference of measurement path 
in the serial movement sequence, i.e., the difference 
in a path of a locus drawn by measuring light on one 
wafer (a locus of the point where the measuring light 
arrives), and the difference in the overall movement 
time of movement sequence necessary for measuring 
positions of alignment marks on one wafer. 

For guaranteeing the true optimum solution, all 
constraint satisfying solutions (feasible basic 
solutions) that can be generated must be checked and 
it requires enormous computational complexity, which 
is not practical. For examination, alterations of 
generation, which seemed sufficient to converge, were 
made in the GA using the MGG model under such preset 
conditions that the group size was somewhat large, the 
crossover rate was somewhat high, and the mutation 
rate was sufficiently high in comparison with the 
group size. When the alterations of generation were 
made with a sufficient time from an initial group 
generated at random, the convergent solution (the true 
optimum solution) was the movement sequence as shown 
in Fig.. 18. The route passed the nearest alignment 
marks after or before the start point ST and the end 
point EN and there existed no seemingly long path. The 
overall movement time was 49.052 sec. After that, the 
convergent solution this time was handled as a 
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substitute for the "true optimum solution," In this GA 
the computation was continued up to the 500th 
generation (for about thirty hours) under the setting 
conditions that the group size N was N = 160^ the 
crossover rate Pc was Pc = 0.5, and the mutation rate 
Pm was Pm = ©•4. A solution regarded as the convergent 
solution first appeared near the 300th generation and 
approximately twenty hours were consximed up thereto. 
This GA can finally reach the optimum solution with 
the sufficient time consumed, but it is not practical 
to apply the GA as it is. Thus, the approaches by the 
GAs shown in Fig. 14 and Fig, 15 above involve some 
means to find the optimum solution quickly. 

Fig. 1 and Fig, 2 show solutions of movement 
sequences for the position measurement of alignment 
marks being measurement targets, which were obtained 
by the prior-art determining method described 
previously. These movement sequences obtained by the 
prior-art approach are utilized from an empirical hope 
that they might not be the optimum solution, but may 
be satisfactorily good solutions. In the following 
description this approach is thus called "approach 
based on the rule of thumb." 

When the approach based on the rule of thumb is 
applied, the operator determines the movement sequence 
that seems empirically good, according to the 
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arrangement of chip areas to be measured. For example, 
a movement sequence obtained by the approach based on 
the rule of thumb is as follows- First, when the chip 
area 64 closest to the start point ST is selected as a 
5 first chip area to be measured, the measurement is 

carried out clockwise from the alignment mark AMI (see 
Fig. 12) of this chip area via AM2 and AM3 to AM4 in 
order (of course, the measurement can be carried out 
counterclockwise in the order of AMI, AM4, AM3 , and 

10 AM2). After completion of the position measurement of 

the all alignment marks given to the chip area 64, the 
measurement is continued to measure the positions of 
the all alignment marks of the chip area 63. 
Subsequently, the positions of the all alignment marks 

15 of the chip area 62 are measured in the same manner 

and then the positions of the all alignment marks in 
the all chip areas are successively measured. 

With the approach based- on the rule of thumb 
used herein, the optimization of movement sequence for 

20 the position measurement of alignment marks was not 

effected taking account of such a route that before 
completion of the position measurement of the all 
alignment marks of one chip area, the measuring point 
visited an alignment mark of another chip area and 

25 thereafter again visited the remaining alignment marks 

of the first chip area. Therefore, the approach based 


65 


NIK97-13US 


on the rule of thumb takes a short time for 
computation/ but it fails to effect the optimization 
taking full account of the various conditions of the 
positions of the respective alignment marks, except 
for restriction conditions such as the performance of 
the stage or the chip areas. 

The solution obtained by the approach based on 
the rule of thumb was as shown in Pig. 1 where the 
measurement was conducted clockwise for the positions 
of the alignment marks in each chip area. The overall 
movement time of the resultant movement sequence (from 
the start point ST to the end point EN) was 53.149 sec. 
When the measurement was conducted counterclockwise 
for the positions of the alignment marks in each chip 
area, the overall movement time of the resultant 
movement sequence (from the start point ST to the end 
point EN) was 52.757 sec. In contrast, the overall 
movement time of the resultant movement sequence ^ - 
according to the optimum solution obtained by the GA 
of the MGG model described above was 49.052 sec. It is 
thus seen that the overall movement time can be 
shortened by about 3 sec from that of the movement 
sequence obtained by the approach based on the rule of 
thumb. 

Linear P rogramming Methods Nearest Neighbor Method (NN 
method ) 
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In general, even if a solution of local partial 
movement sequences is optimum, a movement sequence 
having such partial movement sequences will not be 
always an optimum solution. When this problem is 
5 solved by the linear programming method to obtain a 

solution, the resultant solution is not always the 
(globally) optimum solution. However, since the 
determining method of movement sequence according to 
the present invention is a method for generating a 

10 good solution, or a locally optimum solution very 

quickly, it is used as an effective near-optimum 
obtaining method where the point is to obtain a 
solution in a short time. 

The nearest neighbor method (the NN method), 

15 which is one of most popular search methods, will be 

described herein as an embodiment by the linear 
programming method. The NN method is a method for 
repeating such an operation as to arbitrarily select 
one starting point and choose a nearest neighbor point 

20 thereto as a next point. 

A point selected once will be excluded in order 
from a .candidate group of next selection. In the 
search process of the best unit movement sequence, 
when there are plural candidates for the unit movement 

25 sequence selected next (or when there are plural 

candidates having an identical and shortest movement 
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time of unit movement sequence for movement to the 
start point of the next position measurement of 
alignment mark) and if the all cases are searched, the 
search could be a search of all solutions in the worst 
5 case. For the purpose of decreasing the computation 

time by avoiding it, an idea to randomly select 
several points out of the next candidate points at 
each point (for example, in such a way as to 
preliminarily assume that p points are selected in the 

10 highest case and arbitrarily determine how many points 

should be selected between 1 and p, at each point) can 
be introduced. 

In this embodiment the above number p is set to 
1 and a solution is sought by the NN method while a 

15 search start position is selected from the end point 

EN and the all alignment marks. The reason is that 
when the above number p is 1 and if the search start 
position is fixed at the start point ST, we can find 
merely one solution. In this embodiment, however, the 

20 measurement start position is set to be the start 

point ST and the measurement end position to be the 
end point EN. In order to satisfy this constraint, the 
start point ST must follow the end point EN. Therefore, 
after the solution is obtained by the NN method, the 

25 units of the gene need to be shifted so as to start 

from the start point ST and end at the end point EN. 
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For quickly carrying out the algorithm of this 
type, it is preferable to make a movement time 
management table of movement patterns preliminarily 
contemplated and to write unit movement times of the 
respective patterns in the memory or the like. With a 
list of candidates for movement from a certain point 
to a next point, it is effective to sort the 
candidates in the order of unit movement times from 
the smallest. This operation makes it possible to 
select points not having passed yet one by one from 
the top of the candidate list. The same table of 
movement patterns is also utilized for increasing the 
efficiency of calculation of movement time similarly 
in the LK method and the evolutionary computation 
method . 

By employing such a method that every time a new 
solution is generated by the NN method, the best 
solution at that point is recorded in the memory 62 of 
the main control system 6, the best solution by the NN 
method at that time can be obtained even with 
interruption of computation on the way. For example, 
the operation in this embodiment is carried out 
according to the flowchart shown in Fig. 20. Eighty 
one solutions were generated by the NN method and the 
computation time for generation of the eighty one 
solutions was about 0.03 sec. As a result, the best 
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solution by the NN method in this embodiment was as 
shown in Fig. 19, and the overall movement time of the 
movement sequence for the position measurement of the 
alignment marks (from the start point ST to the end 
point EN) was 49.614 sec. 

Lin and Kernlghan's approach (the I.K method) 

The LK method is famous as a quick near-optimum 
obtaining method for symmetric TSP (Traveling Salesman 
Problem), i.e., as a technique capable of obtaining a 
near-optimum solution in a very short turnaround time 
of computation and is a technique of development of 
the k-OPT method, which is the general name of the 2- 
OPT method and the 3-OPT method. Details of these 
techniques will be described hereinafter. The TSP is a 
problem: given an arbitrary city matrix for n (n > 1) 
cities, find a minimum- length tour that visits each 
city only once and the all cities . There are symmetric 
and asymmetric TSPs. The symmetric TSP is a TSP - for an 
object in which the overall pathlength is invariant 
even with inversion of the tour order from the 
sequential city visit order given for the matrix of n 
cities. Conversely, the asymmetric TSP is a TSP for an 
object in which the overall pathlength will vary with 
inversion of the tour order from the above city visit 
order . 

The TSP was originally the problem considered in 


70 


NIK97-13US 


order to minimize the length of the tour, for example, 
assuming that a matrix of cities are located randomly 
on a two-dimensional plane or the like. In such a city 
matrix space the shortest path between two arbitrary 
cities (i.e., an arbitrary unit path) is given 
uniquely by connecting the two cities by a straight 
line; therefore, once the visit order of the cities is 
determined, the minimum-length tour path according to 
the visit order can be obtained uniquely. Accordingly, 
finding a city visit order (city permutation) to 
minimize the length of the path to visit the all 
cities is equivalent to obtaining a solution to the 
TSP. Therefore, even if the objective problem is not 
directed to minimization of the path itself (i.e., a 
distance in a physical space) and if it is converted 
to one equivalent to the optimization of permutation, 
the approach of TSP can be applied as it is. For 
example, the technique of TSP can be applied as it is, 
to the problem to determine the movement sequence so 
as to minimize the overall movement time for the 
position measurement of the all alignment marks in the 
one-shot exposure type stepper. The movement times of 
unit movement sequences in this case are not distances 
between two arbitrary alignment marks, but they are 
the times as the memory contents in the movement time 
management table stored in the memory 62 (because the 
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movement along the X-direction and the movement along 
the y-direction of the wafer stage can be made 
independently and simultaneously). Let us consider 
below application of the "LK method" which can be 
5 regarded as an improvement of the k-OPT method. 

The NN method is a "generating method" to 
generate a solution from nothing ^ while the k-OPT 
method and LK method are so-called "improving methods" 
for initially giving a certain initial solution (here, 

10 in the case of the "constraint satisfying problem" to 

require an output solution to satisfy a specific 
constraint, a necessary condition is that the initial 
solution is a feasible basic solution) and 
successively improving the solution. Particularly, the 

15 LK method is a method for repetitively performing such 

an operation as to extract a part of a tour sequence 
of the initial solution and to invert the partial 
order, thereby effecting repetitive improvements even 
in a solution after improved, as long as an 

20 improvement is possible. 

In one improvement, if the sum of lengths of 
paths (two cut paths) at the both ends of the partial 
sequence extracted is smaller than that before the 
inversion, the length of the entire path will be 

25 shortened by the difference between them. Therefore, 

this method is efficient, because attention is focused 
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only on the lengths of the two cut paths and 
computation is unnecessary for the total pathlength. 
This single jLmprovement is called 2-OPT. After 
accomplishment of 2-OPT, when 2-OPT is again carried 
out in a combination of one end of the partial path 
extracted with another end this time, three ends are 
changed as a whole. This is called 3 -OPT. These are 
generally called k-OPT. In the k-OPT the value of k is 
preliminarily given. Therefore, the k-OPT had a 
problem that it was not sufficiently adaptable to the 
case wherein the optimum k value varied depending upon 
the initial solution to be improved. However , the LK 
method updates k in k-OPT ( from 2 ) one by one as long 
as an improvement is made. Thus the LK method overcame 
the problem in the k-OPT method. 

If the method is so arranged that every 
generation of a solution by the LK method the best 
solution at that point is stored in the memory 62 of 
the main control system 6, the best solution by the LK 
method at that time can be obtained even with 
interruption of computation on the way. For example, 
the operation of this embodiment is carried out 
according to the flowchart shown in Fig. 22. 

In this embodiment the LK method is applied to 
near-optimum solutions obtained by the NN method. The 
LK method generates one improved solution per initial 
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solution. Since one improved solution is generated per 
initial solution, the number of initial solutions 
should be determined preferably as high as possible. 
This embodiment employs an approach method for using 
eighty one solutions obtained by the NN method as 
initial solutions and improving them. 

The computation time was about 0.14 sec for 
obtaining eighty one solutions by applying the LK 
method to the all eighty one solutions resulting from 
the forward search of the NN method. Since the 
computation time becomes longer with more improved 
portions by the LK method, it is not simply 
proportional to the number of initial solutions to be 
improved. The result of this experiment showed that 
when the solutions by the NN method were used as 
initial solutions of the LK method, the best solution 
was the solution of the movement sequence shown in Fig. 
21 and the overall movement time of ^the movement 
sequence was 49.117 sec. 

Approach based on the evolutionary computation method 

Next described is the approach based on the 
evolutionary computation method according to the 
present invention. An initial group is first . generated. 
Since the shortest movement times of the above unit 
movement sequences are already recorded in the two- 
dimensional table (see Fig. 10), for example, if a 
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movement sequence is arbitrarily produced so as to 
pass each alignment mark only once without duplication 
with reference to this table, a gene indicating an 
arbitrary feasible basic solution can be generated. 
5 Using plural genes (each of which is a feasible basic 

solution) arbitrarily generated as described/ as an 
initial group, preliminary experiments were 
repetitively conducted with different group sizes, 
crossover rates, and mutation rates. The preliminary 

10 experiments verified that the same solution as the 

optimum solution, which was obtained by 20-hour 
computation by the GA of the MGG model described above, 
was generated within about 20 seconds under such 
setting conditions that the group size N = the number 

15 of alignment marks, the crossover rate Pc = 0.4, and 

the shift mutation rate Pm = 0.4 and that the LK 
operator was applied to the all genes . 

Further, experiments were conducted using the 
solutions obtained by the above LK method as initial 

20 solutions. It is highly possible that a solution 

obtained by the NN method + the LK method (i.e., by 
the LK method using the solutions obtained by the NN 
method, as initial solutions) is a local solution in 
the optimization problem of measurement process 

25 sequence of coordinates of the alignment marks (which 

is a locally optimum solution, but is not a true 
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optimum solution or a globally optimum solution) . This 
thus raises a risk of biasing the solution search 
space in the global search by the GA (i.e., a risk of 
search biased to near the local solution). In order to 
avoid it/ the initial group should desirably include a 
solution of good quality representing a completely 
different movement sequence / as well as the best 
solution by the NN method + the LK method. If the 
added solution ^ though representing the completely 
different movement sequence ^ is not of good quality ^ 
it will be dismissed to be extinct soon in the GA. 
Therefore, this example employed a method for sampling 
solutions without replacement by the number equivalent 
to the group size (N) at predetermined probabilities/ 
each at a probability consistent with an evaluation 
value of each solution, out of the solutions obtained 
by the NN method and the solutions obtained by 
applying the LK method to the all solutions obtained 
by the NN method, in addition to the best solution 
obtained by the NN method + the LK method (the 
roulette wheel selection) . This group of solutions was 
used as an initial gene group in the GA. 

With the above group being used as an initial 
gene group and under such setting conditions that the 
group size N = the number of alignment marks, the 
crossover rate Pc = 0.4, and the shift mutation rate 
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Pm = 0.4 and that the LK operator was applied to the 
all genes, the GA stably generated a solution superior 
to that by the LK method within 2 sec and also 
generated the same solution as the optimum solution 
obtained after 20-hour computation by the GA of the 
MGG model, within 10 sec. 

Fig. 23 to Fig. 25 are drawings to show 
relations among the various approaches described above, 
computation times necessary for such approaches , and 
overall movement times of movement sequences obtained 
by the approaches . The ordinate indicates the overall 
movement time of movement sequence obtained and the 
abscissa the computation time. The units are sec. 

Fig. 23 is a drawing to illustrate the overall 
movement time of the movement sequence of the solution 
obtained by the approach based on the rule of thumb, 
the time necessairy for obtaining the best solution by 
the NN method and-the overall movement time of the 
movement sequence being the solution that time, and 
the computation time necessary for obtaining the best 
solution by the LK method using the solution set by 
the NN method as initial solutions and the overall 
movement time of the movement sequence being the 
solution that time. 

Fig. 24 is a drawing to show the result achieved 
when the GA was applied to an initial group as being a 
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set of solutions consisting of the feasible basic 
solutions randomly generated as described above. 
Further/ Fig, 25 is a drawing to show the result 
achieved when the GA was applied to an initial group 
5 as being a set of solutions consisting of the 

solutions obtained by the NN method + the LK method. 
It is apparent from either figure that the best 
solution is updated with progress in alteration of 
generation. For comparison's sake. Fig. 24 and Fig. 25 

10 also show the overall movement time of the movement 

sequence of the solution obtained by the approach 
based on the rule of thumb and the overall movement 
time of the movement sequence of the best solution 
obtained by the LK method using the solution set by 

15 the NN method as an initial solution group. The best 

solution obtained by the GA shown in Fig. 24 was 
obtained in the computation time of 6.18 sec and the 
overall movement time of the movement sequence being 
the best solution obtained was 49.052 sec- Similarly, 

20 the best solution obtained by the GA shown in Fig. 25 

was obtained in the computation time of 5.00 sec and 
the overall movement time of the movement sequence 
obtained was 49.052 sec. 

The above experiments were conducted on a 

25 workstation provided with the CPU of 200 MHz. It 

should be noted that the above computation times 
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naturally differ depending upon the performance of the 
CPU or the like. 

Next described referring to Fig. 26 to Fig. 49 
are embodiments of the designing method and apparatus 
5 of optical system according to the present invention, 

and the recording medium in which the program for 
realizing the designing method is recorded. In the 
drawings the same portions will be denoted by the same 
reference symbols and the description thereof will be 

10 omitted . 

The designing method of optical system according 
to the present invention can also be realized by a 
program described in predetermined language. This 
program is executed, for example, by an arithmetic 

15 circuit having memories (RAM, ROM) or by a computer 1 

as shown in Fig. 26. Particularly, the computer 1 
shown in Fig. 26 has a main control system 100 
composed of control section 101, arithmetic section 
102, and storage section 103 (memory) and also has 

20 peripheral devices, including input section 104, 

output section 105, and external storage device 106 
such as a hard disk. Examples of the output section 
105 include display devices such as CRT 115 or a 
liquid-crystal display. When the designing method of 

25 optical system according to the present invention is 

executed by the computer 1 as shown in Fig. 26, it is 
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preferred to record the program for realizing the 
designing method of optical system in the above 
external storage device 106 or to optically or 
magnetically record the program in a predetermined 
recording medium such as CD 115^ MO, FD, a magnetic 
tape, or ROM. 

For easier understanding / definitions of terms 
used in this specification will be described below. 

Individual: autonomous individual featured by 

genes ; 

Population: set of individuals; 

Population size: number of individuals allowed 
to exist in a population; 

Gene: basic component representing genetic 
information ; 

Allele: value that each gene can take; 

Chromosome: genes expressed in the form of a 
string (a string of characters), a vector (a string of 
numerals), or a mixture thereof; 

Locus: position of a gene on a chromosome; 

Phenotype: external expression of a character 
revealed by a chromosome; 

Genotype: gene expression (internal expression) 
for defining a character; 

Coding: mapping from phenotype to genotype; 

Decoding: inverse mapping from genotype to 
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phenotype ; 

Fitness value: scalar evaluation value which 
indicates a degree of fitness of an individual to an 
environment (in the case where the object is the 
optimization problem, a fitness value is a value of an 
objective function itself or a value of a penalty 
function taking account of constraints); 

Genetic operations: operations of crossover, 
mutation, and selection; 

Selection: selection for survival of individuals 
according to their fitness values (to make a pair of 
individuals for mating, taking account of difference 
in fitness value); 

Crossover: recombination of genes between 
individuals (to make new individuals by mutual 
recombination of chromosomes of a pair of individuals 
between them) ; 

Mutation:" replacement with an allele (to replace 
a value of a certain locus with another allele); 

Generation: one cycle of genetic operations; 

Diversity: degree of preserving diversity of 

gene; 

Parallel algorithms: model in a distributed 
population; 

Migration: exchange of individuals between 
plural populations . 
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In the following description we will consider 
two evaluation criteria and first check the global 
search capability of the designing method where the 
designing method of optical system according to the 
present invention is applied to design of a lens 
system of three lenses and a lens system of four 
lenses while the two evaluation criteria are converted 
to a single objective (the first embodiment). Next, we 
will handle the two evaluation criteria as multiple 
objectives to show excellent multi-objective 
optimization capability of the designing method of 
optical system according to the present invention as 
to design of a lens system of three lenses (the second 
embodiment) . 

In the embodiments of the designing method of 
optical system according to the present invention, 
supposing the user sets the focal length t, brightness 
F, and field angle 2w as design specif ications of- lens 
system, let us consider optimization of radii of 
curvature ri of respective lens surfaces (boundary 
surfaces of optical elements composing the optical 
system)./ thicknesses of lenses, and distances di 
between the lenses. It is assumed that the refractive 
indices of glasses and the number N of lenses are 
preliminarily given. A radius of curvature of a lens 
surface located closest to the film surface (the image 
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20 


plane) and a distance from the lens surface to the 
film surface will be corrected so as to satisfy the 
focal length given by tracing of paraxial rays. 
Accordingly r design of a lens system of N lenses is 
equivalent to optimization of a (4N - 2 ) -dimensional 
function. 

A lens system designed is evaluated based on 
three spot diagrams produced by letting totally eleven 
rays around the principal ray pass through the lens 
system at three angles of incidence of 0**, 0.65w**, and- 
w**. The two evaluation criteria are distortion (D) and 
resolution (R) . The distortion D is given as a 
distance between an ideal image position (f -tanw) and 
an image point of the principal ray. The resolution R 
is given as a standard deviation to indicate a degree 
of dispersion of image points of the other ten rays 
than the principal ray from the image point of the 
principal ray, as shown in the following Equation ^ 2^^^ 
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In the equations, (xideal, yideal) represents the 
ideal image position, (xO, yO) the position of the 
image point of the principal ray, and (xk, yk) 
positions of the respective image points of the other 
5 ten rays. 

Next, the genetic algorithm (GA) will be 
explained. 

The GA is an engineering model imitating the 
evolutionary process of organism, and a general 
10 algorithm thereof is formed as follows. Specifically, 

it is composed of the following operations. 

(1) Generation of Initial Population 

A plurality of individuals (solution candidates) 
are generated and are used as an initial population. 
15 (2) Selection for Reproduction 

Individuals to be parents (parent individuals) 
are selected from the population. 

(3) Generation of Children 

Child individuals (new solution candidates) are 
20 generated by applying the crossover operator and/or 

the mutation operator to the parent individuals. 
(.4) Selection for Survival 

Individuals to form a next-generation population 
are selected from the population of the individuals in 
25 the current generation population and the child 

individuals generated in above step (3). 
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(5) The above steps (2) to (4) are repeated 
before a predetermined end condition is satisfied. 

Design items of the GA are classified roughly 
into design of coding/crossover-mutation and design of 
5 generation model. The design of coding/crossover- 

mutation involves the design of a representation 
method for representing individuals and the design of 
a generating method for generating new individuals, 
and is dependent on a problem region. Since the 
10 performance of the GA is greatly affected by the 

design of coding/crossover-mutation/ it is a very 
important design item. 

On the other hand, the design of generation 
alteration model involves the design of a selecting 
15 method for selecting parents to generate children and 

the design of a selecting method for selecting 
individuals to be left in the next generation 
population. By well managing this design, it becomes 
possible to search at a time a set of solutions in 
20 plural tradeoff relations in the multi-objective 

optimization . 

TJie design of coding/crossover will be described 

below . 

In the embodiments of the designing method of 
25 optical system according to the present invention, 

real vector representation is employed as coding. An 
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individual is expressed by a real vector (rl, r2,..«, 
rn^ dl/ d2,..., dn) components of which are radii of 
curvature and distances between surfaces (parameters 
featuring each candidate of optical system to be 
designed). 

[n this embodiment UNDX (Ono^ I. and Kobayashi^ 
S: A Read-coded Genetic Algorithm for Function 
Optimization Using Unimodal Normal Distribution 
Crossover/ Broceeding of 7th International Conference 
10 on Genetic Algorithms, pp. 246-253 (1997)) is adopted 

as a crossover Operator. The UNDX generates , from two 
parents of Parent\l and Parent 2 out of selected 
parents / two children according to a normal 
distribution set around them, as shown in Fig. 27. The 
15 standard deviation of tthe normal distribution is set 

so that a component ol a\ong the major-axis direction 
connecting the both parents is proportional to a 
distance between the parentsV ( al = adl where dl: the 
distance between Parent 1 and^arent 2) and so that a 
20 component a2 along the other axis is proportional to a 

distance between the major axis abd Parent 3 (a 2 = 
pd3, where d3: the distance betwee^ Parent 3 and the 
axis connecting Parent 1 with ParenA2). Fig, 2 7 
illustrates an example of two variables. 
25 This crossover operator is effective to 

functions wherein strong dependence exists between 
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variables, and multimodal functions. This is 
conceivably because the UNDX undergoes the global 
search in the early stage of search where distances 
are large between parents and undergoes the local 
search in the final stage where the distances are 
close between parents and it can accomplish the search 
little dependent upon the coordinate system. 
First embodiment by the designing^ method of optical 
system 

In the designing method of optical system of the 
first embodiment according to the present invention , 
the effectiveness concerning the global search 
capability of the GA will be verified by experiments 
with scalar evaluation values being set by linear 
combination under a given tradeoff ratio of resolution 
R and distortion D. 

Used as a generation alteration model of the GA 
in Embodiment 1 is the MGG model shown in. Fig. . 28 „ . 
(Satoh/ H., Yamamura, M. and Kobayashi, S.: Minimal 
Generation Gap Model for GAs Considering Both 
Exploration and Exploitation, Proceedings of IIZUKA 
'96/ PP.* 494-497 (1996)). In this generation 
alteration model , n crossovers are carried out each 
with two individuals randomly selected as parents from 
a population. Then alteration of generation is done by 
returning one elitist individual and one individual 
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selected by roulette selection out of the parent and 
child individuals into the population. 

For verifying the effectiveness of the designing 
method in this first embodiment, optimization was done 
for the lens system of three lenses and the lens 
system of four lenses. For the lens system of four 
lenses, three experiments were conducted with 
different design specifications. In the all 
experiments, the number of individuals in the 
population was 100, the number of crossovers n was 100, 
a of UNDX was 0.5, and p thereof was 0.35. The 
tradeoff ratio of distortion D and resolution R was 
1:1. 

Experiment 1 

The optimization of the lens system of three 
lenses was conducted as Experiment 1 of the first 
embodiment. The design specifications were as follows; 
the focal length f = 100 mm, the brightness F = 3.0, .„ 
and the field angle 2w = 38.0**. This experiment was 
started from the initial population consisting of 
random lenses LI to L3 as shown in Fig. 29 and Fig. 30 
and was ended in the stage after evaluation of four 
million lenses. In these Fig. 29 and Fig. 30, S 
represents a stop and I the image plane. A lens 
configuration and spot diagrams of the resultant 
optical system are shown in Fig. 31 and an aberration 
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diagram thereof in Fig. 32. The spot diagrams are 
indicated in the range of ±0.5 mm. 
Experiment 2 

The optimization of the lens system of four 
lenses was conducted as Experiment 2 of the first 
embodiment. The design specifications were set in the 
following three types; the standard lens 
specifications were f = 50 mm, F = 3.0/ and 2w = 46"*; 
the telephoto lens specifications were f = 135 mm, F = 
2.8, and 2w = 18.2"*; the wideangle lens specifications 
were f = 20 iran, F = 5.6, and 2w = 92*". As in the case 
of the lens system of three lenses, the search was 
started from lenses Ll to L4 of random configuration 
and the search was ended after one million evaluations . 
Lens configurations and spot diagrams of resultant 
optical systems are shown in each of Fig. 33 to Fig. 
35. Fig. 33 shows optical systems obtained in the case 
of the standard lens specif ications , Fig. 34 optical 
systems obtained in the case of the telephoto lens 
specifications, and Fig. 35 optical systems obtained 
in the case of the wideangle lens specifications. 

Iji Experiment 1 of the first embodiment, ten 
trials all resulted in obtaining the lens system 
having the lenses, i.e., a triplet in the 
configuration as shown in Fig. 31. The triplet 
considered to be empirically optimal was obtained by 
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starting the search from the fairly random lens t 
systems without relying on knowledge. It can be said 
from this result that the effectiveness of the global 
search of lens system by the GA was verified. 
5 In Experiment 2 the lens systems were also 

achieved with relatively high performance under the 
respective design specifications. For either of the 
telephoto lens and wideangle lens^ almost identical 
configurations were obtained in each trial (two 
10 patterns each), as shown in Fig. 34 and Fig. 35. For 

the standard lens, lenses of various configurations 
(six patterns) were obtained as shown in Fig. 33. 
Second e mbodiment by the designing method of optical 

system 

15 Next, the multi-objective optimization by the GA 

will be described as the second embodiment according 
to the present invention. 

In this second embodiment two objectives of the 
distortion and resolution are set explicitly and 

20 effectiveness will be verified concerning the multi- 

objective optimization capability of the GA. In this 
second embodiment the generation alteration model is a 
model based on the non-Pareto solution selection 
strategy (Shigenobu Kobayashi, Koji Yoshida, and 

25 Masayuki Yamamura: Generation of Pareto optimal 

setting tree set by GA, Jinkochinou Gakkai Shi, vol. 
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11^ No, 5, pp- 778-785 (1996)) (see Fig. 36). 

A crossover is carried out with n pairs randomly 
selected from a population. Reproduced children are 
merged with the existing population and individuals 
5 not being Pareto solutions are weeded out, so that a 

population of the next generation is composed of only 
Pareto solutions. A Pareto solution is a solution 
superior in at least one evaluation criterion to the 
all other solutions. This operation allows the multi- 

10 objective optimization to be conducted explicitly 

without setting the tradeoff ratio between the 
resolution and distortion. 

In this second embodiment the effectiveness of 
the multi-objective optimization by the GA will be 

15 verified by an experiment with an object of the lens 

system of three lenses. An initial population prepared 
includes five Pareto solutions obtained in the search 
by the single-objective GA, the number of crossovers n 
is 40, and the number of evaluations is 1,600,000 (the 

20 search is carried on up to the evaluation number of 

1,600,000). The design specifications of the lens 
system ,are the same as in Experiment 1 of the first 
embodiment described above. Fig. 37 shows plots of the 
resultant Pareto solution set on the distortion- 

25 resolution plane. The number of resultant Pareto 

solutions is 750. 
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Also in this second embodiment, if the search is 
further carried on from the Pareto solution set shown 
in Fig. 37, the Pareto solution curve will be evolved 
toward the left bottom corner. However, the rate of 
evolution is already slow, and it is thus assumed that 
the cuirve is sufficiently close to a true Pareto 
optimal solution curve. It also became apparent that 
the resolution was improved up to a certain value with 
no penalty of distortion but a further improvement in 
the resolution over that value was hard even with a 
penalty of distortion. Configurations of resultant 
solutions all are triplets and it is thus considered 
that the triplets are optimal in the entire region 
under the design specifications. 

"Fig. 38 shows a state in which the best solution 
P (the l^ns system shown in Fig. 31) obtained in 
Experiment\l of the first embodiment described above 
is plotted orl the enlarged view of Fig. 37. In the 
drawing letter\S indicates lens systems dominating the 
solution found bV the single-objective optimization of 
the evaluation cri^ria. As also apparent from this 
Fig. 38, it is clear Jiy seen that the second embodiment 
(multi-objective optimrzation) obtains many more 
excellent solutions than^hat obtained by the single- 
objective GA. This conceivably suggests that there is 
a possibility of making the problem harder if the 
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multi-objective problem is forced to be the single- 
object ive\ problem. 

It seems more natural that a plurality of lenses 
in the tradeoff relation are first obtained by a 
5 search and an appropriate lens is selected therefrom 

according to judgment of selection favored by the 
designer, rather than the way of converting the 
problem to the single-objective problem by linear 
combination of plural evaluation criteria and 

10 obtaining one lens by a search* 

As described above, the embodiments of the 
designing method of optical system according to the 
present invention clarified the effectiveness thereof 
by the above experiments. Particularly, the multi- 

15 objective GA handling the evaluation criteria 

explicitly permits us to search lenses with different 
performances at a time. 

Since the designing method of optical system 
according to the present invention employs the 

20 principle of successively selecting near-optimal 

solutions and finally outputting the optimal solution 
from the two features of the principle of the genetic 
algorithm, i.e., (1) or (la), and (2) below, a near 
solution or the optimal solution can be generated 

25 efficiently according to the length of turnaround 

computation time. 


93 


NIK97-13US 


(1) Simultaneous progress of local search and 
global search in a problem space by only the crossover 
operator 

(la) Simultaneous progress of local search and 
global search in a problem space by the combination of 
the crossover operator with the mutation operator 

(2) Alteration of generation for defining a 
series of operations with genetic operators as one 
generation and repetitively effecting the series of 
operations on finite genes including the elitist gene 
of each generation 

A chromosome in the genetic algorithm is 
represented by a string having a length corresponding 
to the number of parameters (a variable string 
consisting of variables of genes), in which a gene at 
a position (locus) of occurrence of each parameter 
corresponds to each parameter featuring the lens 
system to be designed. 

The crossover operator acts to make a gene of a 
child from genes of a pair of two parents in such a 
manner that, as to each parameter, the parameter of 
the gene of the child takes a value occurring at a 
predetermined probability in a partial space defined 
by parameters of the two parents. 

When the mutation operator is also used in 
addition, a gene of a child individual is generated in 
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such a manner that, as to each parameter, a parameter 
value is a value. occurring according to a 
predetermined continuous occurrence probability 
distribution of occurrence probabilities increasing 
5 with approaching a parameter value of one parent 

arbitrarily selected. 

Since candidates of solutions generated 
internally can be always evaluated directly, a 
solution not satisfying the constraint arbitrarily 

10 given is allowed to become extinct. Thus, the multi- 

objective evaluation can be done even in the case of 
plural objective functions corresponding to the 
evaluation criteria. 

Further, during execution of the GA, the 

15 alteration of generation is made so that the elitist 

solution (or Pareto optimal solution) out of a 
plurality of genes (satisfying the arbitrary 
constraint) stored is always preserved and selection 
is made so that better solutions are more unlikely to 

20 become extinct and thus are left in the next 

generation, thus enabling multi-point simultaneous 
searches with high efficiency. 

Third embodiment by the designing method of optical 
system 

25 The third embodiment in the designing method of 

optical system according to the present invention will 
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be described referring to Fig. 39 to Fig. 41. This 
third embodiment is an example for obtaining the 
optimal solution of a lens system for photography ^ 
which is a lens system of three lenses the refractive 
indices of which are known. Namely , the refractive 
indices of the three respective lenses are set by the 
user to be constraints. 

^g. 39 is a schematic diagram of the structure 
r of the plrotographic lens system. In this figure g 
10 designates^he image plane. The photographic lens 

system of thrs figure is an example of the three-lens 
conf iguration^ \in which there are six boundary 
surfaces of a toNg having their respective curvatures, 
and six distances of dl to d6 between the boundary 
15 surfaces (dl between\A and B, d2 between B and d3 

between C and D, d4 between D and d5 between E and 
F/ and d6 between F andX^). 

In the case of the photographic lens system of 
Fig. 39 the refractive indices of spaces between the 
20 lens boundary surfaces are preliminarily given , but if 

degrees of freedom are also given to the refractive 
indices of the spaces between the lens boundary 
surfaces the number of parameters will increase by the 
degrees. Since the field angle and overall focal 
25 length of the lens optical system are normally 

constraints given by the designer, once six (e.g., a- 
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f) are determined out of a-g and five (e.g., dl-d5) 
out of dl-d6/ the two remaining parameters (e.g., g 
and d6) are determined naturally. This third 
embodiment becomes a simultaneous optimization problem 
of ten parameters accordingly. 

The distances between the boundary surfaces 
themselves are used for dl to d6 (i.e., the necessary 
conditions are dl-d6 > 0 in order to avoid physical 
interference) and an inverse of a radius of curvature 
is used for indicating each curvature of a-g. The 
reason why the inverse numbers are used is that use of 
inverse allows us to handle the change in curvature as 
continuous change on the parameter space, thereby 
being ready for transition of surface configuration 
into the both convex and concave spaces with small 
change of a flat surface (having the radius of 
curvature of the infinity) . 

There are a variety of evaluation criteria for 
lens system and, corresponding thereto, a variety of 
performance functions can be defined. Famous criteria 
are five evaluation criteria represented by the 
Seidel's five aberrations, which can also be applied 
to this embodiment. For brevity of description, 
however, this embodiment shows an example using two 
conflicting evaluation criteria, using the ray tracing 
by spot diagrams . 
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Fig. 40 illustrates a photographic lens system 
generally called a triplet type. In the drawing the 
left side is the object side while the right side is 
the image side. When light is assumed to come from the 
5 infinity on the object side and to be represented by a 

parallel beam, this beam will be converged at one 
point on the image plane g. An image formed on the 
entire image plane can be evaluated by consideration 
on typical beams coming from the left of the same 

10 drawing, including a parallel beam parallel to the 

optic axis of the lens system (rays indicated by solid 
lines in the drawing), a parallel beam an angle of 
which to the optic axis is the maximum field angle in 
the specifications of the lens system (rays indicated 

15 by dashed lines in the drawing), and a parallel beam 

an angle of which to the optic axis is approximately a 
half of the maximum field angle in the specifications 
of the lens system (rays indicated by chain lines in 
the drawing ) . 

20 For example, let us consider the degree of this 

convergence as an evaluation criterion and evaluate 
the degree of convergence by dispersion of image-plane 
arrival points of three rays> the paraxial ray of the 
above beams, a ray on the maximum outside diameter 

25 (normally, a diameter corresponding to the diameter of 

the entrance pupil) of the above beams, and rays on 
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intermediate diameters (for example, diameters equal 
to approximately 70 % and 80 % of the maximum outside 
diameter) between the two rays, from the image-plane 
arrival position of the paraxial ray. This evaluation 
criterion approximately stands for the resolution of 
the image formed on the image plane, if the sagittal 
aberration, coma, and flare are ignored. When this 
dispersion is checked for the above three types of 
parallel beams, approximate evaluation can be made as 
to the resolution of the formed image. 

For example, let us consider another evaluation 
criterion of measuring a deviation between a position 
where the principal ray actually reaches the image 
plane and a position where it should impinge 
theoretically, for each of the above three types of 
parallel beams . An evaluation value is obtained by 
overall evaluation of deviations from the theoretical 
values of the three principal rays obtained 
corresponding to the respective principal rays (for 
example, by measuring the standard deviation from the 
mean value of the deviations) and it approximately 
corresponds to the distortion of the formed image. 

When the multi-objective optimization is carried 
out by using two different evaluation criteria and 
minimizing the both two objective functions 
corresponding to the two evaluation criteria 
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(resolution and distortion) as described above, it 
will result in obtaining many multi-objective optimal 
solutions (Pareto optimal solutions)/ including 
solutions in which one of values of the two objective 
5 functions is very small while the other is large, and 

solutions in which the both are moderately small. 

In the embodiment of the designing method of 
optical system according to the present invention, the 
genetic operator in the genetic algorithm being one of 
10 the evolutionary computation methods is the crossover 

operator for directly operating continuous values, or 
the combination of the crossover operator for directly 
operating continuous values with the mutation operator 
for directly handling continuous values. 
15 ^^^^ Xig- 41 illustrates a gene representation of ten 

^ jf^^-^/^ paramete^ of continuous values featuring the lens 

system in uhe three-lens configuration shown in Fig. 
39. In eachlof a-g and dl-d5 in the same drawing a 
parameter of tWe corresponding lens system is stored 
20 in the form of continuous value. Among such genes, n 

(n > 1) genes satisfying the minimum constraints are 
reproduced arbitrarily 

Then the crossover operator for directly 
handling continuous values is applied to a pair of two 
25 genes selected properly. With the crossover operator, 

an occurrence probability distribution of new-born 
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genes is set on a space of a parameter (a) from 
corresponding parameters in two genes {for example, 
let avl be a value of the parameter a of one gene and 
av2 be a value of the parameter a of the other gene) . 
5 This occurrence probability distribution 

normally has a configuration in which occurrence 
probabilities become higher with getting near avl and 
av2. Then selected from the partial space defined by 
this occurrence probability distribution are two 

10 values of the parameter a (for example, av3 and av4) 

that occur according to the occurrence probabilities 
and that a new-born gene should have. This operation 
is effected on the all parameters (the ten parameters 
in the example of Fig. 41). 

15 This third embodiment can also employ a method 

for handling the individual parameters independently 
of each other or a method for simultaneously selecting 
some parameters or the all parameters in consideration 
of correlation between parameters as in the NDX. In 

20 this way a pair of two genes are sampled with 

replacement Pc x n times according to a preset 
crossover rate Pc from a population of n genes so as 
to be modified so that a gene of better evaluation 
result is more likely to be selected. The crossover 

25 operator is again applied to the sampled gene 

population. 
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This operation increases the number of genes in 
the gene group from n to n x (1 + 2Pc). In the case of 
a single performance function (i.e»/ in the case of a 
single evaluation criterion or in the case of a 
5 plurality of evaluation criteria being combined so as 

to be expressed by one performance function) , a gene 
of the best evaluation value is always left and 
sampling without replacement is carried on so that 
genes of better evaluation values are more likely to 

10 be selected and left, before the nximber of genes in 

the gene group reaches n. The other 2 x n x Pc genes 
are eliminated. A population of n new genes is 
reproduced in this way. The population of genes 
reproduced in this way is called the first generation 

15 of genetic algorithm. Repetition of such generation 

alteration will find the optimal solution sooner or 
later if a sufficient computation time is permitted. 
Unless a sufficient computation time is given, a 
solution with good quality consistent with the 

20 computation time can be generated efficiently. 

When the multi-objective optimization is carried 
out with plural performance functions, only multi- 
objective optimal genes are left out of the population 
of the all (n X (1 + 2Pc)) genes after generation of 

25 new-born genes (or the multi-objective optimal genes 

are always left while genes being not multi-objective 
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optimal and numbering k (k > 1) times the number of 
the multi-objective optimal genes are also left), and 
the other genes are allowed to become extinct, thus 
forming a gene population of a new generation. In this 
case, however, care is needed, because the number of 
genes n in the gene population varies generation from 
generation. 

As the genetic operator for directly handling 
continuous values, the mutation operator for directly 
handling continuous values can also be used, as well 
as the crossover operator for directly handling 
continuous values. In this case, genes are randomly 
sampled with replacement Pm x n x ( 1 + 2Pc) times 
according to the mutation rate Pm preliminarily set, 
from the population of (n x (1 + 2Pc)) genes after the 
application of the crossover operator. Then new-born 
genes being mutants are reproduced so that an 
arbitrary (one or more) parameter of the genes is 
altered arbitrarily (or in a probability distribution 
biased to the neighborhood of its original value) . 
Immediately after the application of the mutation 
operator, the number of genes in the gene group is n x 
(1 + Pm) X (1 + 2Pc). Then genes to be left in the 
next generation are selected out of this gene 
population in the same manner as above, thus 
generating the gene population of the next generation. 
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The above embodiments were described using the 
typical parameters represented by the radii of 
curvatures of the boundary surfaces and the distances 
between the boundary surfaces, but it is needless to 
5 mention that the optimal solution can be gained with 

additional parameters of continuous values related to 
the lens designing , such as the refractive indices of 
the three respective lenses and the pressure between 
the lenses. 

10 As described above, the designing method of 

optical system according to the present invention 
employs the principle of successively selecting near- 
optimal solutions and finally presenting the optimal 
solution from the two features of the principle of the 

15 genetic algorithm, and, therefore, a near solution or 

the optimal solution can be reproduced efficiently 
according to the length of turnaround computation time. 
Fourth embodiment by the designing method of optical 

system 

20 The fourth embodiment in the designing method of 

optical system according to the present invention will 
be described referring to Fig. 42 and Fig. 43. 

In this fourth embodiment Fig. 39 explained 
previously will be used as a figure for defining 

25 parameters. Fig. 42 is a drawing to show the flowchart 

for explaining procedures of the designing method of 
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optical system according to the fourth embodiment and 
Fig. 43 is a drawing to schematically show the 
operations in the fourth embodiment. 

The individual of the optical system shown in 
Fig. 39 has the parameters including the curvatures a- 
f of the lens surfaces (boundary surfaces) of the 
respective lenses L1-L3, the surface-to-surface 
distances (distances between the boundary surfaces) AB 
= dl, BC = d2^ CD = d3, DE = d4, EF = d5, FG = d6, the 
refractive indices of media between the lens surfaces 
nab = nl, nbc = n2, ncd = n3 , nde = n4^ nef = n5, nfg 
= n6, and Abbe's numbers vab = vl/ vbc = v2 , vcd = v3 , 
vde = v4/ vef = v5, vfg = v6 . 

A chromosome of this optical system is 
represented by a string of genes of -a-b-c-d-e-f-dl- 
d2-d3-d4-d5-d6-nl-n2-n3-n4-n5-n6-vl-v2-v3-v4-v5-v6- . 

In this example the curvatures a-f of the 
respective lens surfaces are continuous values and the 
surface-to-surface distances dl-d6 are also continuous 
values. In the above chromosome of the fourth 
embodiment values (alleles) of the genes corresponding 
to the curvatures and surface distances are continuous 
values . 

Next, the operation of the fourth embodiment 
will be described referring to Fig. 42 and Fig. 43. 
Here is described an example in which/ for handling 
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the curvatures a-f of the respective lens surfaces and 
the surface-to-surface distances dl-d6 in common 
dimensions, each parameter is first normalized in the 
range of -2.048 to 2.048. This makes influence on each 
ray continuously equivalent. For example, let us 
consider a lens system characterized by the following 
parameters . 


Curvature Lens distance Refractive index 
(LI) 0.0100 8 1.5 
10 (L2) -0.0083 5 1.8 
(L3) -0.0077 


When in^the above lens system the curvature of 
the lens surface (the final surface) located closest 
to the image plane is changed (bent) in order to 

15 obtain a desired focal length, the above lens system 

is represented in the form of a vector of at most four 
dimensions or less. When the final surface is not bent, 
the above lens system is represented in the form of a 
vector of at most five dimensions or less. 

20 When the above lens system is expressed in the 

form of a five-dimensional vector without 
normalization (note that the refractive indices are 
fixed values), the vector is (0.0100, -0.0083, -0.0077, 
8, 5). For normalizing each parameter (each vector 

25 component), the maximum and minimum need to be set as 

to at least the curvatures and lens distances (glass. 
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air) . 

This setting of maximum and minimum is done by 
the user (designer). For example, the curvatures are 
set in the range of -0.5 to +0.5/ the surface 
distances with glass in between as a medium 
(corresponding to the thicknesses of lenses) are set 
in the range of 0.1 to 50, and the surface distances 
with air in between as a medium (the aerial distances 
between the lenses) are set in the range of 0.1 to 100. 
In this case, a normalized value of each parameter is 
defined as follows. 

(normalized value) = 4.096 x (value of 
parameter )/ (maximum of parameter - minimum of 
parameter ) 

Accordingly, the normalized vector is given as 
(0.04096, -0.0339968, -0.0315392, 0.6567, 0.41042). 

In this fourth embodiment 0 is first set as an 
initial value into the- parameter t indicating the 
generation (step STO). 

Then a plurality of lens data for the components 
of the above parameters a-f, dl-d6, nl-n6, and vl-v6 
are reproduced to form an initial population ( step 
STl ) . In this step random numbers are used for 
generation of plural parameters to form the initial 
population. On this occasion each parameter made of a 
random number is normalized as described above. If an 
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optical system characterized by these parameters (a 
candidate of an optical system to be designed) 
satisfies predetermined constraints and if all rays 
incident into this optical system can reach the image 
5 plane, the parameters for this optical system are used 

as an initial population. The above constraints are 
preferably, for example/ conditions for eliminating 
optical systems that cannot exist physically (for 
example, optical systems having a negative surface-to- 

10 surface distance) (for efficient search). Such 

constraints can be set by the designer himself or 
herself on the occasion of setting the maximums and 
minimums of the curvatures and the other parameters. 

Since in this fourth embodiment the field angle 

15 and overall focal length of the optical system are the 

constraints given by the designer, values satisfying 
the above constraints, obtained by tracing of paraxial 
rays in the optical system, are used for the curvature 
f of the lens surface closest to the image plane in 

20 the optical system and the distance d6 from the lens 

surface closest to the image plane thereto. In this 
fourth embodiment the refractive indices of the 
respective lenses Ll to L3 are given preliminarily and 
the medium between the lenses L1-L3 is the air. Thus, 

25 the chromosome of the optical system in this 

embodiment is given as -a-b-c-d-e-dl-d2-d3-d4-d5- . 
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In this case, the above initial population 
includes a plurality of individuals having different 
chromosomes • 

Subsequently, 1 is added to the generation 
5 parameter t (step ST2) , and thereafter, as shown in 

Fig. 43, a set of three parents Pal, Pa2, Pa3 are 
selected from the above initial population (step STB). 
If t > 2, a set of three parents Pal, Pa2, Pa3, are 
not selected from the initial population, but from a 

10 population of the generation (t - 1) one before the 

current generation (t). 

In step ST4 new genes are reproduced from the 
parents Pal, Pa2, Pa3 selected in above step 3. Namely, 
the crossover is effected. The crossover operator in 

15 this fourth embodiment is the UNDX (Uni-modal Normal 

Distribution Crossover) out of the genetic operators 
for directly handling continuous values. Therefore, in 
the following description, the operation in step ST4 
with application of UNDX will be described referring 

20 to Fig. 44. 

First, considering a space of n dimensions 
corresponding to the number of genes that the 
chromosomes of individuals have, each individual is 
expressed by a vector of the n dimensions. In this 

25 fourth embodiment, since a chromosome is composed of 

the ten genes -a-b-c-d-e-dl-d2-d3-d4-d5- as described 
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above / n = 10 and each individual is expressed by a 
vector of the ten dimensions (a, b^ o, e, dl, d2, 
d3 , dA, d5). In Fig. 44 each individual is represented 
as a three-dimensional vector for illustration's sake. 

Substep ST4-1 

First, as shown by 441 in Fig. 44, substep ST4-1 
is carried out to set points VCl, VC2, VC3 on the 
space, corresponding to the parents Pal, Pa2, Pa3 
selected in step ST3 . Here, let M be a middle point 
between the point VCl and the point VC2 and H be the 
length of a perpendicular from the point VC3 to a line 
segment VC1-VC2. 
Substep ST4-2 

Next, as shown by 442 in Fig. 44, substep ST4-2 
is carried out to reproduce normal random deviates 
with the middle point as an expectation and oa as a 
standard deviation. According to the normal random 
deviates, a point PI is reproduced on the segment VCl- 
VC2 (the point Pi is generated on the segment VC1-VC2 
in a probability according to such a normal 
distribution that the middle point M is at the vertex). 
Substep ST4-3 

Further, as shown by 443 in Fig. 44, substep 
ST4-3 is carried out to generate a probability space 
Ex of an n-dimensional normal distribution according 
to n normal random deviates with the point PI as an 
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expectation and ob as a standard deviation. In this 
embodiment the probability space is the one of ten- 
dimensional normal distribution. 
Substep ST4-4 

5 As shown by 444 in Fig. 44^ this substep ST4-4 

is carried out to reproduce a point P2 according to 
the n-dimensional normal random deviates having the 
probability space Ex of the n-dimensional normal 
distribution reproduced in above substep ST4-3. Namely^ 

10 the point P2 is reproduced in the probability 

according to the n-dimensional normal distribution. 
Substep ST4-5 

As shown by 445 in Fig. 44/ this substep ST4-5 
is carried out to set a hyperplane Jt perpendicular to 

15 the vector VC1-VC2/ including the point PI. The 

hyperplane n is a plane of the (n - 1) dimensions 
obtained by subtracting 1 from the number of 
dimensions of the space. In this embodiment the 
hyperplane is of nine dimensions. A point P3 is 

20 defined at a point where a perpendicular from the 

point P2 to the hyperplane ji falls. 
Substeg ST4-6 

As shown by 446 in Fig. 44, this substep ST4-6 
is carried out to generate a point P4 capable of 

25 constituting a vector P1-P4 having a component 

parallel to a vector P1-P3 starting from the point Pl 
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and having a length according to normal random 
deviates with the point Pi as an expectation and ac as 
a standard deviation • Here, ac is proportional to the 
n-th power root of a distance H-VC3 between the point 
H and the point VC3 in step ST4-1. 

^he n-dimensional coordinates of the point P4 
reproduc^ by above steps ST4-1 to ST4-6 correspond to 
the n parameters a, b, c, d, e, t, dl, d2, €L3 , d4/ d5 
of a chromosOT^e of a new-born gene or a child. In this 
step ST4 of theXfourth embodiment the substeps ST4-1 
to ST4-6 describedVabove are repeated m times, whereby 
m new genes are repi^^uced from the three parents Pal, 
Pa2, Pa3. ^ 

After completion of above step ST4 , evaluation 
values are calculated for the set (family set) of the 
m new-born genes reproduced in this step ST4 and the 
two parents Pal, Pa2. In this fourth embodiment the 
evaluation values (fitness values) are expressed by a 
single evaluation criterion or by one performance 
function <j> consisting of a set of plural evaluation 
criteria. For example, when there are k evaluation 
criteria abe(l), abe( 2 ),..., abe(k), they can be 
linearly combined with weights on the respective 
evaluation criteria abe ( 1 ) , abe ( 2 ) , • • • , abe ( k ) • Namely , 
the performance function <|> is given as ((» = Wl*abe(l) + 
W2*abe(2) +...+ Wk*abe(k). Then an individual with the 
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best evaluation value and an individual selected by 
roulette selection are selected from the above family 
set (step ST5) . 

The roulette selection is a technique for 
selecting an individual in a predetermined probability 
in proportion to a fitness value (evaluation value) of 
each individual or in proportion to the ranking of the 
fitness value. For example, let us consider an excuaple 
of four individuals A, B, C, D. In the former fitness 
value proportion method, supposing the fitness value 
of individual A is 40, that of individual B 60, that 
of individual C 100, and that of individual D 200, the 
probability of selection of individual A is 10 %, that 
of individual B 15 %, that of individual C 25 %, and 
that of individual D 50 % (A:B:C:D = 10 : 15 :25 : 50 ) • In 
the latter fitness value ranking proportion method, 
the probability of selection of individual A is 10 %, 
that of individual B 20 %, that of individual jC. . 30,.,% , 
and that of individual D 40 % (A:B:C:D = 1:2:3:4). The 
fourth embodiment employs the latter fitness value 
ranking proportion method (ranking method), but it 
should be noted that the invention is not limited to 
this method. 

Further, in step S6 the individuals except for 
the two individuals selected in above step S5 are 
allowed to become extinct (selection). Then the 
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selected individuals replace the parents Pal, Pa2 in 
the initial population. This replacement of the 
parents in the population by the selected individuals 
is called the alteration of generation. 

Subsequently in step ST7, an individual having 
the best evaluation value is outputted out of the 
individuals in the population after the alteration of 
generation (presentation of the best solution) and the 
operation of step ST2 is repeated. 

Here, above steps ST2 to ST7 are called the 
first generation of the genetic algorithm. In the 
second generation and after, three parents Pal, Pa2/ 
Pa3 are selected from the population after the 
alteration of generation. 

Repetitive operations of such alteration of 
generation will automatically reproduce an optical 
system as a globally optimal solution independent of 
the initial solution given. 

The fourth embodiment described above uses only 
the crossover operator , but the mutation operator may 
also be used in addition to this crossover operator. 

An example of application of the mutation 
operator in addition to the crossover operator will be 
described. In this fourth embodiment above steps ST5 
and ST6 select genes to be left in the next generation 
from the set (family set) of the m new-born genes and 
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the two parents Pal, Pa2 . 

In the case of the mutation operator being 
applied/ new chromosomes are sampled at random Pm x (m 
+ 2) times according to the preset mutation rate Pm 
out of the set (family set) of the m new-born genes 
and two parents Pal/ Pa2. Then new genes of mutants 
are reproduced by changing an arbitrary (one or more) 
chromosome out of the genes in the sampled chromosomes 
on an arbitrary basis (or according to a probability 
distribution biased to the neighborhood of its 
original value). After that/ genes to be left in the 
next generation are selected from the set of the m 
new-born genes, the new-born genes reproduced as 
mutants / and the two parents Pal/ Pa2. Namely/ the two 
parents Pal/ Pa2 in the population are replaced by an 
individual of the best evaluation value and an 
individual selected by roulette selection from this 
set. 

In this fourth embodiment it is also possible to 
apply only the mutation operator without application 
of the crossover operator. In this case/ instead of 
above steps ST3 to ST6/ new chromosomes are sampled at 
random by the number of times according to the 
mutation rate Pm out of the population consisting of 
the chromosomes corresponding to the plural lens data. 
Then mutants are generated by changing an arbitrary 


115 


NIK97-13US 


(one or more) gene of the sampled chromosomes 
according to a probability distribution biased to the 
vicinity of its original value. Then chromosomes to be 
left in the next generation are selected from this set 
of mutants. 

Next described is an example of the multi- 
objective optimization in the fourth embodiment. In 
steps ST5 and ST6 of the fourth embodiment^ one 
performance function 4> is used at the time of 
selection of the individual of the best evaluation 
value and selection of the individual by roulette 
selection from the set (family set) of the m new-born 
genes and two parents Pal^ Pa2, as described above. 
These two individuals selected then replace the two 
parents Pal, Pa2 in the population. This means that 
even cases of presence of plural evaluation criteria 
are handled in the form of a single performance 
function by linearly combining these evaluation 
criteria. 

Ih. the case of the multi-objective optimization, 
steps Sll a^d S15-S17 below are executed in place of 
above steps STi and ST5-ST7. 

The operation of the multi-objective 
optimization will be described using the flowchart 
shown in Fig. 45 and the schematic diagram of 
operations shown in Fig. 46. 
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Step STllO is different from above step STl in 
that evaluation values of plural performance functions 
<|>l-<|>k are calculated according to a plurality of (k) 
evaluation criteria of plural individuals (lens data) 
5 in the initial population. 

Next, step ST150 is a step of calculating each 
of evaluation values of plural performance functions 
<t)l-<(>k for each individual in the set (family set) of 
the m new-born genes reproduced in above step ST4 and 

10 the two parents Pal, Pa2. Then individuals not being 

Pareto optimal (non-Pareto solutions) are selected 
from these evaluation values. Here, a Pareto optimal 
solution means, under presence of plural evaluation 
criteria, a solution superior in at least one 

15 evaluation criterion out of the plural evaluation 

criteria to the other solutions. 

Subsequently, step STl 60 is carried out to weed 
out individuals not being Pareto optimal. This step 
eliminates individuals inferior to the other 

20 individuals in the all evaluation functions among the 

evaluation values of the plural performance functions 
4»l-4>k. The Pareto optimal individuals left herein 
compose a population of the next generation and in the 
next generation parents for the crossover operator are 

25 selected from this population. 

After completion of step STl 60, step STl 70 is 
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carried out to output the Pareto optimal individuals 
obtained in above step ST160^ and then the steps are 
repeated in order from step ST2, 

After that, steps ST2-ST4 and steps S150-S160 
5 are repeated in the same manner, thereby effecting the 

multi-objective optimization of individuals in the 
population. 

Also in the case of the multi-objective 
optimization as described, the mutation parameter 
J5 10 described above can also be applied in addition to the 

=5 crossover operator. Further, the mutation parameter 

can also be applied instead of the crossover operator. 
J-^ . The execution of the above steps (steps STllO, 

L ST2-ST4, and ST150-ST170) peirmits the multi-objective 

'.•.ST 

15 optimization to be effected even with presence of 

plural evaluation criteria for optical system and 
without placing the tradeoff ratio (weights) on these 
evaluation criteria. This results in always presein?i.ng 
the elitist individual out of plural chromosomes in 

20 the population and selecting better individuals to be 

left in the next generation according to such 
predetermined probabilities that better individuals 
are more unlikely to be weeded out, thus enabling the 
multi-point simultaneous search with efficiency. 

25 In the Pareto optimization described above, the 

number of individuals in the population differs 
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depending upon the generation. The Pareto optimization 
described above has such an advantage that the 
population size does not become too large ^ because 
non-Pareto solutions are weeded out in order. 

The fourth embodiment described above employed 
the UNDX as a crossover operator, but, instead thereof, 
BLX-a or NDX can also be applied. 

It can also be contemplated that, for example, 
some of plural evaluation criteria are linearly 
combined to form plural performance functions and 
these performance functions are subjected to the 
multi-objective optimization. 

Described below are application examples where 
the designing method of optical system according to 
the fourth embodiment is applied to an optical system 
consisting of fifteen lenses. 
Application 1 

Application 1 of the fourth embodiment is to 
design a projection optical system for transferring a 
circuit pattern on a mask onto a wafer at a 
demagnif ication ratio . 

In this Application 1, the evaluation criteria 
of the optical system are those (1) to (16) listed 
below and the performance functions are those obtained 
by linearly combining some evaluation criteria. 
(1) Spherical aberration at the maximum 
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numerical aperture 

(2) Spherical aberration at 75 % of the maximum 
numerical aperture 

(3) Sagittal image surface at the maximum object 

height 

(4) Meridional coma of upper rays at the maximum 
object height and at 70 % of the maximum numerical 
aperture 

(5) Meridional coma of lower rays at the maximum 
object height and at 70 % of the maximum numerical 
aperture 

(6) Meridional coma of upper rays at the maximum 
object height and at 50 % of the maximum numerical 
aperture 

( 7 ) Meridional coma of lower rays at the maximum 
object height and at 50 % of the maximum numerical 
aperture 

(8) Meridional coma of upper rays at 75 % of the 
maximum object height and at 70 % of the maximum 
numerical aperture 

(9) Meridional coma of lower rays at 75 % of the 
maximuir\ object height and at 70 % of the maximum 
numerical aperture 

(10) Meridional coma of upper rays at 75 % of 
the maximum object height and at 50 % of the maximum 
numerical aperture 
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(11) Meridional coma of lower rays at 75 % of 
the maximum object height and at 50 % of the maximum 
numerical aperture 

(12) Meridional coma of upper rays at 50 % of 
the maximum object height and at 70 % of the maximum 
numerical aperture 

(13) Meridional coma of lower rays at 50 % of 
the maximum object height and at 70 % of the maximum 
nvunerical aperture 

(14) Meridional coma of upper rays at 50 % of 
the maximum object height and at 50 % of the maximum 
numerical aperture 

(15) Meridional coma of lower rays at 50 % of 
the maximum object height and at 50 % of the maximum 
numerical aperture 


conditions) of (17) to (19) listed below are given. 

(17) A distance from the lens surface closest to 
the reticle thereto, WD > 25 

(18) Magnification of the overall system, P = - 
3.0 (in the ray tracing from the wafer side to the 
reticle side) 

(19) Distances between the lens surfaces, d < 25 
In this Application 1, parameters to be altered 

in designing of the optical system are the curvatures 


(16) Distortion at the maximum object height 


In this Application 1 constraints (restraint 
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of the respective lens surfaces and surface distances. 
The refractive index n of glass materials of the lens 
elements is preliminarily given as n = 1.56384. 

Under the above preconditions, the parameters in 
the designing method of optical system according to 
the fourth embodiment are set as listed below, 
ize of initial population: 50 
Number of crossovers: 300,000 
Numbet of children generated by crossover 
10 operator: 20 

aa of UND^ 0.5 x VCIVC 
Ob of UNDX: 

oc of UNDX: 0.^5 x (VC1VC2)'^" 
The projection optical system represented by 

15 Table 1 below was generated by above Application 1 . In 

Table 1, p is the magnification from the reduced side 
(the wafer side) to the enlarged side (the reticle 
side), L the conjugate length (object-image distance) , 
NA the reduced-side numerical aperture, and WD the 

20 distance from the lens surface closest to the reticle 

thereto. In Table 1 below, numerals in the left end 
column ^represent surface numbers from the wafer 
surface, r radii of curvature, d surface distances, 
and n indices of refraction. 


25 TABLE 1 

P = -3.0 
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Fig. 47 is a cross-sectional view of the 
projection optical system represented in above Table 1 
and Fig. 48 is an aberration diagram to show 
aberrations thereof. Here, 481 in Fig. 48 is a 
spherical aberration diagram, 482 in Fig. 48 is an 
astigmatism diagram, 483 in Fig. 48 shows the 
meridional coma at the maximum object height, 484 in 
Fig. 48 the meridional coma at 75 % of the maximum 
object Jheight, 485 in Fig. 48 the meridional coma at 
50 % of the maximum object height, 486 in Fig. 48 the 
meridional coma on the optic axis, 487 in Fig. 48 the 
sagittal coma at the maximum object height, 488 in Fig. 
48 the sagittal coma at 75 % of the maximum object 
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height, 490 in Fig. 48 the sagittal coma at 50 % of 
the maximum object height, 491 in Fig. 48 the sagittal 
coma on the optic axis, and 492 in Fig. 48 a 
distortion diagram. In the astigmatism diagram of 482 
in Fig. 48 the dashed line indicates the meridional 
image surface and the solid line the sagittal image 
surface. In Fig. 48 NA denotes the numerical aperture 
and Y the object height. 

Fig. 49 is a cross-sectional view of the 
projection optical system as Application 2 where 
twenty four thousand crossovers were carried out under 
conditions that the size of initial population was 400 
and fifty children were generated. 

By the designing method of optical system in 
each of Applications 1 and 2 as described above, a 
good solution (individual) can be obtained even though 
the number of lens elements is large. 

Although the glass materials for the respective 
lenses were predetermined in above Application 1 and 
Application 2, it is needless to mention that they can 
be defined as parameters. On this occasion, since the 
glass neater ials often exist discretely, the genetic 
parameters used should be those for handling discrete 
values . 

As described above, the designing method of 
optical system according to the present invention has 
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the effect of generating the optical system as a 
globally optimal solution independent of the initial 
solution given. Since the present invention utilizes 
the genetic parameters for handling the continuous 
5 values explicitly, child individuals of the next 

generation generated from parent individuals can 
succeed to characters (properties) of the parent 
individuals and this succession to characters is 
repeated/ thus achieving the effect of eliminating 

10 wasteful searches and thus obtaining the optimal 

solution or the near-optimal solution within a shorter 
time. Further / the designing method of optical system 
of the present invention also has the effect that a 
plurality of optical systems can be generated 

15 simultaneously as multi-objective optimal solutions 

that must exist corresponding to a plurality of 
conflicting evaluation criteria. 
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